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TITLE OF THE INVENTION 

DNA MOLECULES ENCODING HUMAN NUCLEAR 
RECEPTOR PROTEINS, nNR7 and nNR7-l 

5 

CROSS-REFERENCE TO RELATED APPLICATIONS 

Provisional application 60/104,251 filed October 14, 1998, 
which is a continuation of Provisional application 60/069,401 filed 
December 12, 1997. 

10 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D 
Not applicable. 

15 

REFERENCE TO MICROFICHE APPENDIX 
Not applicable. 



20 FIELD OF THE INVENTION 

The present invention relates in part to isolated nucleic acid 
molecules (polynucleotide) which encodes a human nuclear receptor 
proteins, referred to throughout as nNR7 and nNR7-l, respectively. The 
present invention also relates to recombinant vectors and recombinant 

25 hosts which contain a DNA fragment encoding nNR7 and nNR7-l, 
substantially purified forms of associated human nNR7 protein and 
associated human nNR7-l protein, human mutant proteins of nNR7 
and nNR7-l, and methods associated with identifying compounds which 
modulate nNR7 and nNR7-l activity. 



30 
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BACKGROUND OF THE INVENTION 

The nuclear receptor superfamily, which includes steroid 
hormone receptors, are small chemical ligand-inducible transcription 
5 factors which have been shown to play roles in controlling development, 
differentiation and physiological function. Isolation of cDNA clones 
encoding nuclear receptors reveal several characteristics. First, the 
NH2-terminal regions, which vary in length between receptors, is 

hypervariable with low homology between family members. There are 
10 three internal regions of conservation, referred to as domain I, II and 
III. Region I is a cysteine-rich region which is referred to as the DNA 
binding domain (DBD). Regions II and III are within the COOH- 
terminal region of the protein and is also referred to as the ligand 
binding domain (LBD). For a review, see Power et al. (1992, Trends in 
15 Pharmaceutical Sciences 13: 318-323). 

The lipophilic hormones that activate steroid receptors are 
known to be associated with human diseases. Therefore, the respective 
nuclear receptors have been identified as possible targets for therapeutic 
intervention. For a review of the mechanism of action of various steroid 
20 hormone receptors, see Tsai and O'Malley (1994, Annu. Rev. Biochem. 
63: 451-486). 

Recent work with non-steroid nuclear receptors has also 
shown the potential as drug targets for therapeutic intervention. This 
work reports that peroxisome proliferator activated receptor g (PPARg), 

25 identified by a conserved DBD region, promotes adipocyte differentiation 
upon activation and that thiazolidinediones, a class of antidiabetic 
drugs, function through PPARg (Tontonoz et al., 1994, Cell 79: 1147-1156; 
Lehmann et al., 1995, J. Biol Chem. 270(22): 12953-12956; Teboul et al., 
1995, J. Biol Chem. 270(47): 28183-28187). This indicates that PPARg 

30 plays a role in glucose homeostasis and lipid metabolism. 

Baes et al. (1994, Mol Cell Biol. 14(3): 1544- 1552) disclose a 
cDNA clone which encodes a new member of the nuclear receptor 
superfamily, referred herein as hONX. The authors present data 
showing that the gene is expressed mainly in human liver and suggests 

35 that this nuclear receptor may play a role in regulating expression of 
retinoid- responsive genes. 
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Baker, et ai. (1988, Proc. Natl Acad. Sci. (U.S.A.) 85 (10): 
3294-3298) disclose a cDNA clone which encodies human vitamin D 
receptor (hVDR). 

Mangelsdorf et al. (1995, Cell 83: 835-839) provide a review of 
5 known members of the nuclear receptor superfamily, including hONX 
and hVDR. 

It would be advantageous to identify additional genes which 
are members of the nuclear receptor superfamily, especially vertebrate 
members from such species as human, rat and mouse. A nucleic acid 

10 molecule expressing a nuclear receptor protein will be useful in 

screening for compounds acting as a modulator of cell differentiation, 
cell development and physiological function. The present invention 
addresses and meets these needs by disclosing isolated nucleic acid 
molecules which express a human nuclear receptor protein which will 

15 have a role in cell differentiation and development, 

SUMMARY OF THE INVENTION 

The present invention relates to isolated nucleic acid 
molecules (polynucleotides) which encode novel nuclear receptor 

20 proteins which are herein designated as members of the nuclear 
receptor superfamily. The isolated polynucleotides of the present 
invention encode vertebrate members of this nuclear receptor 
superfamily, and preferably human nuclear receptor proteins, such as 
human nuclear receptor proteins exemplified and referred to 

25 throughout this specification as nNR7 and/or nNR7-l. The nuclear 
receptor proteins encoded by the isolated polynucleotides of the present 
invention are involved in the regulation of in vivo cell proliferation 
and/or cell development. Based on amino acid sequence homology, the 
nuclear receptor that is most related to human nNR7 and/or nNR7-l 

30 with known function is the vitamin D receptor. Northern analysis on 
the following human tissue samples -- heart, brain, placenta, lung, 
liver,skeletal muscle, kidney, pancreas, adrenal medulla, thyroid, 
adrenal cortex, testis, thymus, small intestine and stomach -- reveals 
that nNR7 and/or nNR7-l is expressed mainly in liver at medium to low 

35 level and the small intestine. It may be expressed at much higher level 
in other tissues not examined. This data suggest that nNR7 and/or 
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nNR7-l plays important roles in carrying out metabolic functions 
involving D vitamins, since the liver is the major site for generation of 
hydroxylated D vitamins, which are active forms of vitamin D for the 
vitamin D receptor. It is also possible that other vitamin D metabolites 
5 may be active forms for nNR7 and/or nNR7-l in the liver. 

As noted above, nNR7 and/or nNR7-l and are expressed in 
the liver and small intestine. In humans, the cytochrome P-450 
monooxygenase 3A4 CCYP3A4) is mainly expressed in the liver and 
small intestine. The CYP3A4 protein plays an important role in the 

10 biotransformation of drugs, including more than 60% of all clinically 
used drugs, and its expression level is markedly induced by those 
compounds. Therefore, assays that measure the effects of compounds 
on CYP3A4 gene expression can predict whether drugs will interact in 
humans. Because the molecular mechanism underlying this induction 

15 is unclear, CYP3A4 gene induction assays have been almost exclusively 
dependent upon the use of human liver tissue and primary hepatocytes 
to date. The nNR7 and/or nNR7-l nuclear receptor disclosed in this 
application has been disclosed by Lehmann et al. (1998, J. Clin. Invest 
102: 1016-1023) subsequent to the priority filing date of this specification. 

20 The authors identified a response element located in the CYP3A4 

promoter [5-TGAACT caaagg AGGTCA-3' (SEQ ID NO:24)] that was 
shown to bind nNR7 (refered to as hPXR by the authors, but lacking in 
amino acid 1-32 of nNR7 [SEQ ID NO:2]). The authors suggest that 
drugs which induce CYP3A4 gene expression activate nNR7 and initiate 

25 transcription through CYP3A4 promoter. One of the uses of the DNA 
molecules and concomitantly expressed proteins of the present 
invention, including but not limited to nNR7 and nNR7-l, will be useful 
in assays to identify modulators of CPP3A4 levels in vivo. Therefore, 
transactivation assays using nNR7 and/or nNR7-l and theCYP3A4 

30 promoter linked to a reporter gene (such as SEAP secreted placental 
alkaline phosphatase) is one approach for identifying modulators of 
CYP3A4 levels in vivo. 

The present invention also relates to isolated nucleic acid 
fragments which encode mRNA expressing a biologically active novel 

35 vertebrate nuclear receptor which belongs to the nuclear receptor 
superfamily. 
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A preferred embodiment relates to isolated nucleic acid 
fragments of SEQ ID NO: 1 and/or SEQ ID NO:17 which encode mRNA 
expressing a biologically functional derivative of nNR7 and/or nNR7-l, 
respectively. Any such nucleic acid fragment will encode either a 
5 protein or protein fragment comprising at least an intracellular DNA- 
binding domain and/or ligand binding domain, domains conserved 
throughout the human nuclear receptor family domain which exist in 
nNR7 (SEQ ID NO: 2) and/or nNR7-l (SEQ ID NO:18). Any such 
polynucleotide includes but is not necessarily limited to nucleotide 

10 substitutions, deletions, additions, amino-terminal truncations and 

carboxy-terminal truncations such that these mutations encode mRNA 
which express a protein or protein fragment of diagnostic, therapeutic 
or prophylactic use and would be useful for screening for agonists 
and/or antagonists of nNR7 and/or nNR7-l. 

15 The isolated nucleic acid molecules of the present invention 

may include a deoxyribonucleic acid molecule (DNA), such as genomic 
DNA and complementary DNA (cDNA), which may be single (coding or 
noncoding strand) or double stranded, as well as synthetic DNA, such 
as a synthesized, single stranded polynucleotide. The isolated nucleic 

20 acid molecule of the present invention may also include a ribonucleic 
acid molecule (RNA). 

The present invention also relates to recombinant vectors 
and recombinant hosts, both prokaryotic and eukaryotic, which contain 
the substantially purified nucleic acid molecules disclosed throughout 

25 this specification. 

A preferred aspect of the present invention is disclosed in 
Figure 1A-C and SEQ ID NO: 1, an isolated human cDNA encoding a 
novel nuclear trans- acting receptor protein, nNR7. 

An especially preferred aspect of the present invention is 

30 disclosed in Figure 4A-C and SEQ ID NO: 17, an isolated human cDNA 
encoding a novel nuclear trans-acting receptor protein, nNR7-l. 

Another preferred aspect of the present invention relates to 
a substantially purified form of the novel nuclear trans- acting receptor 
protein, nNR7, which is disclosed in Figure 2A-C and Figure 3 and as 

3 5 set forth in SEQ ID NO: 2. 
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Another especially preferred aspect of the present invention 
relates to a substantially purified form of the novel nuclear trans-acting 
receptor protein, nNR7-l, which is disclosed in Figure 5A-C and 
Figure 6 and is set forth as SEQ ID NO: 18. The receptor protein nNR7-l 
5 contains an amino terminal addition compared to nNR7; specifically 
wherein nNR7-l contains an intitiating methionine residue as the NH2- 
terminal amino acid residue. 

The present invention also relates to biologically functional 
derivatives of nNR7 as set forth as SEQ ID NO: 2, including but not 
10 limited to nNR7 mutants and biologically active fragments such as 
amino acid substitutions, deletions, additions, amino terminal 
truncations and carboxy- terminal truncations, such that these 
fragments provide for proteins or protein fragments of diagnostic, 
therapeutic or prophylactic use and would be useful for screening for 
15 agonists and/or antagonists of nNR7 function. 

The present invention also relates to polyclonal and 
monoclonal antibodies raised in response to either the human form of 
nNR7 and/or nNR7-l disclosed herein, or a biologically functional 
derivative thereof. It will be especially preferable to raise antibodies 
20 against epitopes within the NH2-terminal domain of nNR7 and/or 

nNR7-l, which show the least homology to other known proteins 
belonging to the human nuclear receptor superfamily. To this end, the 
DNA molecules, RNA molecules, recombinant protein and antibodies of 
the present invention may be used to screen and measure levels of 

25 human nNR7 and/or nNR7-l. The recombinant proteins, DNA 
molecules, RNA molecules and antibodies lend themselves to the 
formulation of kits suitable for the detection and typing of human nNR7 
and/or nNR7-l . 

The present invention also relates to isolated nucleic acid 

30 molecules which are fusion constructions expressing fusion proteins 
useful in assays to identify compounds which modulate wild-type 
human nNR7 and/or nNR7-l activity. A preferred aspect of this portion 
of the invention includes, but is not limited to, glutathione S- transferase 
GST-nNR7 and/or GST-nNR7-l fusion constructs. These fusion 

35 constructs include, but are not limited to, all or a portion of the ligand- 
binding domain of nNR7 and/or nNR7-l, respectively, as an in-frame 
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fusion at the carboxy terminus of the GST gene. The disclosure of SEQ 
ID NOs:l, 2, 17 and 18 allow the artisan of ordinary skill to construct any 
such nucleic acid molecule encoding a GST-nuclear receptor fusion 
protein. Soluble recombinant GST-nuclear receptor fusion proteins may 
5 be expressed in various expression systems, including Spodoptera 
frugiperda (Sf21) insect cells (Invitrogen) using a baculovirus 
expression vector (e.g., Bac-N-Blue DNA from Invitrogen or pAcG2T 
from Pharmingen). Another preferred aspect of this portion of the 
invention relates to the use of a receptor/reporter system to identify 

10 modulators of nNR7, nNR7-l and/or CYP3A4 levels in vivo. As an 

example, and not as a hmitatdon, the portion of SEQ ID NO:l or SEQ ID 
NO: 17 which encodes the open reading frame of nNR7 and nNR7-l, 
respectively, may be fused downstream of an active promoter, such as a 
CMV promoter fragment, so as to overexpress a receptor protein. A 

15 second construct includes regulatory regions from the human CYP3A4 
gene (such as the human CYP3A4 promoter region or a response 
element [e.g., SEQ ID NO:24]) fused to a reporter gene, such as SEAP 
(secreted placental alkaline phosphatase). These constructs may then 
be utilized in a transactivation assay to identify modulators of CYP3A4 

20 levels in vivo. 

It is an object of the present invention to provide an isolated 
nucleic acid molecule which encodes a novel form of a nuclear receptor 
protein such as human nNR7 and/or nNR7-l, human nuclear receptor 
protein fragments of full length proteins such as nNR7 and/or nNR7-l, 

25 and mutants which are derivatives of SEQ ID NO: 2 and/or SEQ ID 
NO: 18, respectively. Any such polynucleotide includes but is not 
necessarily limited to nucleotide substitutions, deletions, additions, 
a mi no- terminal truncations and carboxy- terminal truncations such 
that these mutations encode mRNA which express a protein or protein 

30 fragment of diagnostic, therapeutic or prophylactic use and would be 
useful for screening for agonists and/or antagonists for nNR7 and/or 
nNR7-l function. 

Another object of this invention is tissue typing using 
probes or antibodies of this invention. In a particular embodiment, 

35 polynucleotide probes are used to identify tissues expressing nNR7 

and/or nNR7-l mKNA. In another embodiment, probes or antibodies 
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can be used to identify a type of tissue based on nNR7 and/or nNR7-l 
expression or display of nNR7 and/or nNR7-l receptors. 

It is a further object of the present invention to provide the 
human nuclear receptor proteins or protein fragments encoded by the 
5 nucleic acid molecules referred to in the preceding paragraph. 

It is a further object of the present invention to provide 
recombinant vectors and recombinant host cells which comprise a 
nucleic acid sequence encoding human nNR7 and/or nNR7-l or a 
biological equivalent thereof. 
10 It is an object of the present invention to provide a 

substantially purified form of nNR7 and/or nNR7-l, as set forth in SEQ 
ID NO: 2 and SEQ ID NO: 18, respectively. 

It is an object of the present invention to provide for 
biologically functional derivatives of nNR7 and/or nNR7-l, including but 
15 not necessarily limited to amino acid substitutions, deletions, additions, 
amino terminal truncations and carboxy-terminal truncations such 
that these fragment and/or mutants provide for proteins or protein 
fragments of diagnostic, therapeutic or prophylactic use. 

It is also an object of the present invention to provide for 
20 in-frame fusion constructions of nNK7 and/or nNR7-l and the like, 
methods of expressing these fusion constructions and biological 
equivalents disclosed herein, related assays, recombinant cells 
expressing these constructs and agonistic and/or antagonistic 
compounds identified through the use DNA molecules encoding human 
25 nuclear receptor proteins such as nNR7 and/or nNR7-l. 

As used herein, "DBD" refers to DNA binding domain. 
As used herein, "LBD" refers to ligand binding domain. 
As used herein, the term "mammalian host" refers to any 
mammal, including a human being. 

30 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1A-C shows the nucleotide sequence which 
comprises the open reading frame encoding the human nuclear 
receptor protein, nNR7 (SEQ ID NO: 1). 
35 Figure 2A-C shows the nucleotide sequence of the isolated 

cDNA molecule (SEQ ID NO: 1) which encodes nNR7, and the amino 
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acid sequence of (SEQ ID NO: 2) nNR7. The region in bold and underline 
is the DNA binding domain. 

Figure 3 shows the amino acid sequence of nNR7 (SEQ ID 
NO: 3). The region in bold and underline is the DNA binding domain. 
5 Figure 4A-C shows the nucleotide sequence which 

comprises the open reading frame encoding the human nuclear 
receptor protein, nNR7 (SEQ ID NO: 1). The region of nucleotides which 
are underlined represent the extended 5'-end of the nNR7 cDNA clone 
(as shown in SEQ ID NO:l and Figure 1A-C). 

10 Figure 5A-C shows the nucleotide sequence of the isolated 

cDNA molecule (SEQ ID NO: 17) which encodes nNR7-l, and the amino 
acid sequence of (SEQ ID NO:18) nNR7-l. The region of nucleotides 
which are underlined represent the extended 5 -end of the nNR7 cDNA 
clone (as shown in SEQ ID NO:l and Figure 1A-C). The amino acid 

15 region underlined represents the DNA binding domain. 

Figure 6 shows the amino acid sequence of nNR7-l (SEQ ID 
NO: 18). The region in bold is the DNA binding domain. The underline 
indicates the extended N-terminus of nNR7-l as compared to nNR7. 

20 DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to isolated nucleic acid and 
protein forms which represent nuclear receptors, preferably but not 
necessarily limited to human receptors. These expressed proteins are 
novel nuclear receptors and which are useful in the identification of 

25 downstream target genes and ligands regulating their activity. The 

nuclear receptor proteins encoded by the isolated polynucleotides of the 
present invention are involved in the regulation of in vivo cell 
proliferation and/or cell development. The nuclear receptor superfamily 
is composed of a group of structurally related receptors which are 

30 regulated by chemically distinct ligands. The common structure for a 
nuclear receptor is a highly conserved DNA binding domain (DBD) 
located in the center of the peptide and the ligand-binding domain (LBD) 
at the COOH- terminus. Eight out of the nine non- variant cysteines form 
two type II zinc fingers which distinguish nuclear receptors from other 

35 DNA-binding proteins. The DBDs share at least 50% to 60% amino acid 
sequence identity even among the most distant members in vertebrates. 

-9- 
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The superfamily has been expanded within the past decade to contain 
approximately 25 subfamilies. A first exemplified cDNA encoding 
human nNR was identified via low stringency screening on a mixed 
cDNA library. The probes used in the screening were DNA fragments 
5 corresponding to AR, ERb, GR and VDR DNA binding domain regions 
and 250,000 primary clones from the mixed library were screened. After 
two rounds of screening, in vivo excision was carried out on the purified 
1ZAPII phages which showed positive signals at low stringency 
condition. Plasmid DNA samples were submitted to sequence directly 

10 using M13 forward and reverse primers. A blast search of the sequence 
information was obtained using computer program PhredPhrap. A 
single cDNA clone, referred to as gm6, is shown herein to encode a novel 
human nuclear receptor, nNR7. DNA sequencing was continued from 
both 5' and 3' ends on nNR7. An intron was identified when sequencing 

15 with pgm6y2 (5'-CTTCAATGTCATGACATG-3'; SEQ ID NO: 3). A 

cDNA fragment flanking the intron region was retrieved via PCR using 
primer pair of NR7F (5-CCAAATCTGCCGTGTATGTG-3'; SEQ ID NO: 
4) and pgm6xC (S'-GTCAGTGCACTCTCCACGT-S*; SEQ ID NO: 5) 
followed by double nested primers of pgm6y2 and pgm6xD 

20 (S'-TGCAGCTGGTCCACCACGCG-S'; SEQ ID NO: 6). A prominent 
DNA fragment of -1.5 kb was amplified form testis and brain cDNAs. 
The fragment was purified and subcloned into pCRII vector (Invitrogen, 
San Diego, CA). Automated sequencing was performed on several of the 
clones. The complete sequencing of gm6 clone revealed a 3.1 kb cDNA 

25 which codes an authentic novel nuclear receptor. The open reading 
frame for the expressed protein does not contain an initiating 
methionine. 

Primer pair pgm6xAR (5'- GGGTATGCTCTGTGACAAG- 
3'; SEQ ID NO: 7) and pgm6x (5'-AGGCAGGCACTTTCATACC-3'; SEQ 

30 ID NO: 8) in 3' non-coding region were used to scan the 83 clones of the 
Stanford radiation hybrid panel (Cox et al., 1990, Science, 250:245:250). 
The PCR results were scored and submitted to the Stanford Genome 
Center for linkage analysis. The result indicated that nNR7 was located 
on chromosome 3. Northern analysis on Clontech blots showed that 

35 nNR7 was mainly expressed in human liver at medium to low level. 
The nNR7 polypeptide has a 42% sequence identity at the amino acid 

- 10- 
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level with human VDR (Baker, et al., 1988, Proc. Natl Acad. ScL 
(U.SJi.) 85 (10): 3294-3298) in the overlapping regions. Additionally, the 
nNR7 polypeptide has a 52% sequence identity at the amino acid level 
with human hONR (Baes et al.,1994, Mol. Cell Biol. 14(3): 1544- 1552) in 
5 the overlapping regions. 

The present invention also relates to isolated nucleic acid 
fragments of nNE7 (SEQ ID NO: 1) which encode mRNA expressing a 
biologically active novel human nuclear receptor. Any such nucleic acid 
fragment will encode either a protein or protein fragment comprising at 
10 least an intracellular DNA-binding domain and/or ligand binding 
domain, domains conserved throughout the human nuclear receptor 
family domain which exist in nNR7 (SEQ ID NO: 2). Any such 
polynucleotide includes but is not necessarily limited to nucleotide 
substitutions, deletions, additions, amino- terminal truncations and 
15 carboxy-terminal truncations such that these mutations encode mRNA 
which express a protein or protein fragment of diagnostic, therapeutic 
or prophylactic use and would be useful for screening for agonists 
and/or antagonists for nNR7 function. 

The isolated nucleic acid molecule of the present invention 
20 may include a deoxyribonucleic acid molecule (DNA), such as genomic 
DNA and complementary DNA (cDNA), which may be single (coding or 
noncoding strand) or double stranded, as well as synthetic DNA, such 
as a synthesized, single stranded polynucleotide. The isolated nucleic 
acid molecule of the present invention may also include a ribonucleic 
25 acid molecule (RNA). 

The present invention also relates to recombinant vectors 
and recombinant hosts, both prokaryotic and eukaryotic, which contain 
the substantially purified nucleic acid molecules disclosed throughout 
this specification. 

30 A preferred aspect of the present invention is disclosed in 

Figure 1A-C and SEQ ID NO: 1, a human cDNA encoding a novel 
nuclear trans-acting receptor protein, nNR7, disclosed as follows: 

TACGCCAAGC TCGAAATTAA CCCTCACTAA AGGGAACAAA AGCTGGAGCT 
CCACCGCGGT GGCGGCCGCT CTAGAACTAG TGGATCCCCC GGGCTGCAGG 
35 AATTCGAATT CTCATAACCT ATGACTAGGA CGGGAAGAGG AAGCACTGCC 
TTTACTTCAG TGGGAATCTC GGCCTCAGCC TGCAAGCCAA GTGTTCACAG 
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TGAGAAAAGC AAGAGAATAA 
GCTCCTTGGT AAAGCTACTC 
AAGTGGACCC CAGGGGAGAA 
CCAAGAGGCC CAGAAGCAAA 
5 CCATGCTGAC TTTGTACACT 
CCAGTGTCAA CGCAGATGAG 
TGTGGGGACA AGGCCACTGG 
ATGCAAGGGC TTTTTCAGGA 
GCCCCTTCCG GAAGGGCGCC 

10 TGCCAGGCCT GCCGCCTGCG 
GATGATCATG TCCGACGAGG 
GGAAGAAAAG TGAACGGACA 
ACAGAGGAGC AGCGGATGAT 
AACCTTTGAC ACTACCTTCT 

15 TGCTTAGCAG TGGCTGCGAG 
GAAGAAGCTG CCAAGTGGAG 
GGTCTCTCTG CAGCTGCGGG 
CCCCAGCCGA CAGTGGCGGG 
GCTGACATGT CAACCTACAT 

20 CATCTCCTAC TTCAGGGACT 
AGGGGGCCGC TTTCGAGCTG 
GCGGAGACTG GAACCTGGGA 
CACTGCAGGT GGCTTCCAGC 
ACTACATGCT GAAGAAGCTG 

25 CAGGCCATCT CCCTCTTCTC 
CGTGGTGGAC CAGCTGCAGG 
TTGAATGCAA TCGGCCCCAG 
ATGGCTATGC TCACCGAGCT 
GCTGCTGCGC ATCCAGGACA 

30 AGTTGTTCGG CATCACAGGT 
CGAGAGGCAG CCAGACCCAG 
ACAGATGGAC ACTGCCAAGA 
AGGGAATTCC TGCTATGACA 
GTGCCCCCCA CCCCCAGTTC 

35 ACGTGGAGAG TGCACTGACC 
TGCCCTTTCC TTTTAAAAGG 



GCTAATACTC CTGTCCTGAA CAAGGCAGCG 
CTTGATCGAT CCTTTGCACC GGATTGTTCA 
GTCGGAGCAA AGAACTTACC ACCAAGCAGT 
CCTGGAGGTG AGACCCAAAG AAAGCTGGAA 
GTGAGGACAC AGAGTCTGTT CCTGGAAAGC 
GAAGTCGGAG GTCCCCAAAT CTGCCGTGTA 
CTATCACTTC AATGTCATGA CATGTGAAGG 
GGGCCATGAA ACGCAACGCC CGGCTGAGGT 
TGCGAGATCA CCCGGAAGAC CCGGCGACAG 
CAAGTGCCTG GAGAGCGGCA TGAAGAAGGA 
CCGTGGAGGA GAGGCQGGCC TTGATCAAGC 
GGGACTCAGC CACTGGGAGT GCAGGGGCTG 
GATCAGGGAG CTGATGGACG CTCAGATGAA 
CCCATTTCAA GAATTTCCGG CTGCCAGGGG 
TTGCCAGAGT CTCTGCAGGC CCCATCGAGG 
CCAGGTCCGG AAAGATCTGT GCTCTTTGAA 
GGGAGGATGG CAGTGTCTGG AACTACAAAC 
AAAGAGATCT TCTCCCTGCT GCCCCACATG 
GTTCAAAGGC ATCATCAGCT TTGCCAAAGT 
TGCCCATCGA GGACCAGATC TCCCTGCTGA 
TGTCAACTGA GATTCAACAC AGTGTTCAAC 
GTGTGGCCGG CTGTCCTACT GCTTGGAAGA 
AACTTCTACT GGAGCCCATG CTGAAATTCC 
CAGCTGCATG AGGAGGAGTA TGTGCTGATG 
CCCAGACCGC CCAGGTGTGC TGCAGCACCG 
AGCAATTCGC CATTACTCTG AAGTCCTACA 
CCTGCTCATA GGTTCTTGTT CCTGAAGATC 
CCGCAGCATC AATGCTCAGC ACACCCAGCG 
TACACCCCTT TGCTACGCCC CTCATGCAGG 
AGCTGAGCGG CTGCCCTTGG GTGACACCTC 
AGCCCTCTGA GCCGCCACTC CCGGGCCAAG 
GCCGACAATG CCCTGCTGGC CTGTCTCCCT 
GCTGGCTAGC ATTCCTCAGG AAGGACATGG 
AGTCTGTAGG GAGTGAAGCC ACAGATTCTT 
TGTAGGTCAG G AC CATC AG A GAGGCAAGGT 
CCCTGTGGTC TGGGGAGAAA TCCCTCAGAT 
- 12- 



WO 99/31129 



PCT/US98/26364 



CCCACTAAAG TGTCAAGGTG TGGAAGGGAC CAAGCGACCA AGGATAGGCC 
ATCTGGGGTC TATGCCCACA TACCCACGTT TGTTCGCTTC CTGAGTCTTT 
TCATTGCTAC CTCTAATAGT CCTGTCTCCC ACTTCCCACT CGTTCCCCTC 
CTCTTCCGAG CTGCTTTGTG GGCTCCAGGC CTGTACTCAT CGGCAGGTGC 
5 ATGAGTATCT GTGGGAGTCC TCTAGAGAGA TGAGAAGCCA GGAGGCCTGC 
ACCAAATGTC AGAAGCTTGG CATGACCTCA TTCCGGCCAC ATCATTCTGT 
GTCTCTGCAT CCATTTGAAC ACATTATTAA GCACCGATAA TAGGTAGCCT 
GCTGTGGGGT ATACAGCATT GACTCAGATA TAGATCCTGA GCTCACAGAG 
TTTATAGTTA AAAAAACAAA CAGAAACACA AACAATTTGG ATCAAAAGGA 

10 GAAATGATAA GTGACAAAAG CAGCACAAGG AATTTCCCTG TGTGGATGCT 
GAGCTGTGAT GGCGGGCACT GGGTACCCAA GTGAAGGTTC CCGAGGACAT 
GAGTCTGTAG GAGCAAGGGC ACAAACTGCA GCTGTGAGTG CGTGTGTGTG 
ATTTGGTGTA GGTAGGTCTG TTTGCCACTT GATGGGGCCT GGGTTTGTTC 
CTGGGGCTGG AATGCTGGGT ATGCTCTGTG ACAAGGCTAC GCTGACAATC 

15 AGTTAAACAC ACCGGAGAAG AACCATTTAC ATGCACCTTA TATTTCTGTG 
TACACATCTA TTCTCAAAGC TAAAGGGTAT GAAAGTGCCT GCCTTGTTTA 
TAGCCACTTG TGAGTAAAAA TTTTTTTGCA TTTTCACAAA TTATACTTTA 
TATAAGGCAT TCCACACCTA AGAACTAGTT TTGGGAAATG TAGCCCTGGG 
TTTAATGTCA AATCAAGGCA AAAGGAATTA AATAATGTAC TTTTGGCTAG 

20 AGGGGTAAAC TTTTTTGGCC TTTTTCTGGG GAAAATAATG TGGGGGTGTG 
GGAATTCGAA TTCGATATCA AGCTTATCGA TACCGTCGAC CTCGAGGGGG 
GGCCCGGT AC CCAATTCGCC CTATAGTGAG TCGTATTACA ATT 
{ SEQ ID NO : 1 ) . 

The above-exemplified isolated DNA molecule, shown in 
25 Figure 1A-C and set forth as SEQ ID NO:l, contains an open reading 

frame from nucleotide 276 to nucleotide 1673, with a "TGA" termination 
codon from nucleotides 1674-1676. This open reading frame encodes a 
human nNR7 receptor, a 466 amino acid protein shown in Figures 2A-C 
and Figure 3 and as set forth in SEQ ID NO:2. 
30 Therefore,the present invention also relates to a 

substantially purified form of the novel nuclear trans-acting receptor 
protein, nNR7, shown in Figures 2A-C and Figure 3 and as set forth in 
SEQ ID NO: 2, disclosed as follows: 

SILCTGLFKV DPRGEVGAKN LPPSSPRGPE ANLEVRPKES WNHADFVHCE 
35 DTESVPGKPS VNADEEVGGP QICRVCGDKA TGYHFNVMTC EGCKGFFRRA 
MKRNARLRCP FRKGACEITR KTRRQCQACR LRKCLESGMK KEMIMSDEAV 
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EERRALIKRK KSERTGTQPL GVQGLTEEQR MMIRELMDAQ MKTFDTTFSH 
FKNFRLPGVL SSGCELPESL QAPSREEAAK WSQVRKDLCS LKVSLQLRGE 
DGSVWNYKPP ADSGG KEIFS LLPHMADMST YMFKGIISFA KVISYFRDLP 
IEDQISLLKG AAFELCQLRF NTVFNAETGT WECGRLSYCL EDTAGGFQQL 
5 LLEPMLKFHY MLKKLQLHEE EYVLMQAISL FSPDRPGVLQ HRWDQLQEQ 
FAITLKSYIE CNRPQPAHRF LFLKIMAMLT ELRSINAQHT QRLLRIQDIH 
PFATPLMQEL FGITGS ( SEQ ID NO: 2). 

The present invention also relates to biologically functional 
derivatives and/or mutants of nNR7 as set forth as SEQ ID NO: 2, 

10 including but not necessarily limited to amino acid substitutions, 

deletions, additions, amino terminal truncations and carboxy- terminal 
truncations such that these mutations provide for proteins or protein 
fragments of diagnostic, therapeutic or prophylactic use and would be 
useful for screening for agonists and/or antagonists of nNR7 function. 

15 Another exemplification of the present invention relates to an 

isolated DNA molecule which comprises the nucleotide sequence as 
disclosed in Figure 4A-C and as set forth in SEQ ID NO:17. The initial 
exemplified cDNA molecule disclosed in Figure 1A-C and set forth as 
SEQ ID NO:l does not contain a starting Met. Therefore, extension of 

20 the nNR7 5'-end sequence was continued. Three clones from 5'-end 
RACE (rapid amplifi cation of cDNA end) showed divergence from the 
original cDNA as set forth in SEQ ID NO:l. One of these clones was 
shown to contain an intitating Met with an inframe stop codon in front. 
This DNA molecule is set forth herein in Figure 4A-C and is set forth as 

25 SEQ ID NO:17. This DNA molecule encodes a complete reading frame 
for an nNR7 type nuclear receptor and is refered herein as nNR7-l, and 
is set forth in Figures 5A-C and Figure 6, as well as SEQ ID NO: 18. 

This portion of the present invention also relates to sent 
invention relates to isolated nucleic acid fragments of SEQ ID NO: 17 

30 which encode mRNA expressing a biologically active novel human 

nuclear receptor. Any such nucleic acid fragment will encode either a 
protein or protein fragment comprising at least an intracellular DNA- 
binding domain and/or ligand binding domain, domains conserved 
throughout the human nuclear receptor family domain which exist in 

35 nNR7-l (SEQ ID NO: 18). Any such polynucleotide includes but is not 
necessarily limited to nucleotide substitutions, deletions, additions, 
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amino- terminal truncations and carboxy- terminal truncations such 
that these mutations encode mRNA which express a protein or protein 
fragment of diagnostic, therapeutic or prophylactic use and would be 
useful for screening for agonists and/or antagonists for nNR7-l 
5 function. 

The isolated nucleic acid molecule of this portion of the 
invention may include a deoxyribonucleic acid molecule (DNA), such as 
genomic DNA and complementary DNA (cDNA), which may be single 
(coding or noncoding strand) or double stranded, as well as synthetic 

10 DNA, such as a synthesized, single stranded polynucleotide. The 

isolated nucleic acid molecule of the present invention may also include 
a ribonucleic acid molecule (RNA). 

The present invention also relates to recombinant vectors 
and recombinant hosts, both prokaryotic and eukaryotic, which contain 

15 the substantially purified nucleic acid molecules disclosed throughout 
this specification. 

An especially preferred aspect of the present invention is 
disclosed in Figure 4A-C and SEQ ID NO: 17, a human cDNA encoding 
a novel nuclear trans-acting receptor protein, nNR7-l, disclosed as 

20 follows: 

TCCATCCTAA TACGACTCAC TATAGGGCTC GAGCGGCCGC CCGGGCKGGT 
CTTTTGGCCT GCTGGGTTAG TGCTGGCAGC CCCCTGAGGC CAAGGACAGC 
AGCATGACAG TCACCAGGAC TCACCACTTC AAGGAGGGGT CCCTCAGAGC 
ACCTGCCATA CCCCTGCACA GTGCTGCGGC TGAGTTGGCT TCAAACCATC 

25 CAAGAGGCCC AGAAGCAAAC CTGGAGGTGA GACCCAAAGA AAGCTGGAAC 
CATGCTGACT TTGTACACTG TGAGGACACA GAGTCTGTTC CTGGAAAGCC 
CAGTGTCAAC GCAGATGAGG AAGTCGGAGG TCCCCAAATC TGCCGTGTAT 
GTGGGGACAA GGCCACTGGC TATCACTTCA ATGTCATGAC ATGTGAAGGA 
TGCAAGGGCT TTTTCAGGAG GGCCATGAAA CGCAACGCCC GGCTGAGGTG 

30 CCCCTTCCGG AAGGGCGCCT GCGAGATCAC CCGGAAGACC CGGCGACAGT 
GCCAGGCCTG CCGCCTGCGC AAGTGCCTGG AGAGCGGCAT GAAGAAGGAG 
ATGATCATGT CCGACGAGGC CGTGGAGGAG AGGCGGGCCT TGATCAAGCG 
GAAGAAAAGT GAACGGACAG GGACTCAGCC ACTGGGAGTG CAGGGGCTGA 
CAGAGGAGCA GCGGATGATG ATCAGGGAGC TGATGGACGC TCAGATGAAA 

35 ACCTTTGACA CTACCTTCTC CCATTTCAAG AATTTCCGGC TGCCAGGGGT 
GCTTAGCAGT GGCTGCGAGT TGCCAGAGTC TCTGCAGGCC CCATCGAGGG 
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AAGAAGCTGC CAAGTGGAGC 
GTCTCTCTGC AGCTGCGGGG 
CCCAGCCGAC AGTGGCGGGA 
CTGACATGTC AACCTACATG 
5 ATCTCCTACT TCAGGGACTT 
GGGGGCCGCT TTCGAGCTGT 
CGGAGACTGG AACCTGGGAG 
ACTGCAGGTG GCTTCCAGCA 
CTACATGCTG AAGAAGCTGC 

10 AGGCCATCTC CCTCTTCTCC 
GTGGTGGACC AGCTGCAGGA 
TGAATGCAAT CGGCCCCAGC 
TGGCTATGCT CACCGAGCTC 
CTGCTGCGCA TCCAGGACAT 

15 GTTGTTCGGC ATCACAGGTA 
GAGAGGCAGC CAGACCCAGA 
CAGATGGACA CTGCCAAGAG 
GGGAATTCCT GCTATGACAG 
TGCCCCCCAC CCCCAGTTCA 

20 CGTGGAGAGT GCACTGACCT 
GCCCTTTCCT TTTAAAAGGC 
CCACTAAAGT GTCAAGGTGT 
TCTGGGGTCT ATGCCCACAT 
CATTGCTACC TCTAATAGTC 

25 TCTTCCGAGC TGCTTTGTGG 
TGAGTATCTG TGGGAGTCCT 
CCAAATGTCA GAAGCTTGGC 
TCTCTGCATC CATTTGAACA 
CTGTGGGGTA TACAGCATTG 

30 TTATAGTTAA AAAAACAAAC 
AAATGATAAG TGACAAAAGC 
AGCTGTGATG GCGGGCACTG 
AGTCTGTAGG AGCAAGGGCA 
TTTGGTGTAG GTAGGTCTGT 

35 TGGGGCTGGA ATGCTGGGTA 
GTTAAACACA CCGGAGAAGA 



CAGGTCCGGA AAGATCTGTG CTCTTTGAAG 
GGAGGATGGC AGTGTCTGGA AC TAC AAACC 
AAGAGATCTT CTCCCTGCTG CCCCACATGG 
TTCAAAGGCA TCATCAGCTT TGCCAAAGTC 
GCCCATCGAG GACCAGATCT CCCTGCTGAA 
GTCAACTGAG ATTCAACACA GTGTTCAACG 
TGTGGCCGGC TGTCCTACTG CTTGGAAGAC 
ACTTCTACTG GAGCCCATGC TGAAATTCCA 
AGCTGCATGA GGAGGAGTAT GTGCTGATGC 
CCAGACCGCC CAGGTGTGCT GCAGCACCGC 
GCAATTCGCC ATTACTCTGA AGTCCTACAT 
CTGCTCATAG GTTCTTGTTC CTGAAGATCA 
CGCAGCATCA ATGCTCAGCA CACCCAGCGG 
ACACCCCTTT GCTACGCCCC TCATGCAGGA 
GCTGAGCGGC TGCCCTTGGG TGACACCTCC 
GCCCTCTGAG CCGCCACTCC CGGGCCAAGA 
CCGACAATGC CCTGCTGGCC TGTCTCCCTA 
CTGGCTAGCA TTCCTCAGGA AGGACATGGG 
GTCTGTAGGG AGTGAAGCCA CAGATTCTTA 
GTAGGTCAGG ACCATCAGAG AGGCAAGGTT 
CCTGTGGTCT GGGGAGAAAT CCCTCAGATC 
GGAAGGGACC AAGCGACCAA GGATAGGCCA 
ACCCACGTTT GTTCGCTTCC TGAGTCTTTT 
CTGTCTCCCA CTTCCCACTC GTTCCCCTCC 
GCTCCAGGCC TGTACTCATC GGCAGGTGCA 
C TAG AG AG AT GAGAAGCCAG GAGGCCTGCA 
ATGACCTCAT TCCGGCCACA TCATTCTGTG 
CATTATTAAG CACCGATAAT AGGTAGCCTG 
ACTCAGATAT AGATCCTGAG CTCACAGAGT 
AGAAACACAA ACAATTTGGA TCAAAAGGAG 
AGCACAAGGA ATTTCCCTGT GTGGATGCTG 
GGTACCCAAG TGAAGGTTCC CGAGGACATG 
CAAACTGCAG CTGTGAGTGC GTGTGTGTGA 
TTGCCACTTG ATGGGGCCTG GGTTTGTTCC 
TGCTCTGTGA CAAGGCTACG CTGACAATCA 
ACCATTTACA TGCACCTTAT ATTTCTGTGT 
- 16- 



WO 99/31129 



PCT/US98/26364 



ACACATCTAT TCTCAAAGCT AAAGGGTATG AAAGTGCCTG CCTTGTTTAT 
AGCCACTTGT GAGTAAAAAT TTTTTTGCAT TTTCACAAAT TATACTTTAT 
ATAAGGCATT CCACACCTAA GAACTAGTTT TGGGAAATGT AGCCCTGGGT 
TTAATGTCAA ATCAAGGCAA AAGGAATTAA ATAATGTACT TTTGGCTAGA 
5 GGGGTAAACT TTTTTGGCCT TTTTCTG GGG AAAATAATGT GGGGGTGTGG 
{SEQ ID NO: 17) . 

The above-exemplified isolated DNA molecule, shown in 
Figure 4A-C and set forth as SEQ ID NO: 17, contains an open reading 
frame from nucleotide 104 to nucleotide 1522, with a "TGA" termination 

10 codon from nucleotides 1523-1525. This open reading frame encodes a 
human nNR7-l receptor, a 473 amino acid protein shown in 
Figures 5A-C and Figure 6 and as set forth in SEQ ID NO: 18. 

Therefore, the present invention also relates to a 
substantially purified form of the novel nuclear trans-acting receptor 

15 protein, nNR7-l, which is shown in Figures 5A-C and Figure 6 and as 
set forth in SEQ ID NO: 18, disclosed as follows: 

MTVTRTHHFK EGSLRAPAIP LHSAAAELAS NHPRGPEANL EVRPKESWNH 
ADFVHCEDTE SVPGKPSVNA DEEVGGPQIC RVCGDKATGY HFNVMTCEGC 
KGFFRRAMKR NARLRCPFRK GACEITRKTR RQCQACRLRK CLESGMKKEM 

20 IMSDEAVEER RALIKRKKSE RTGTQPLGVQ GLTEEQRMMI RELMDAQMKT 
FDTTFSHFKN FRLPGVLSSG CELPESLQAP SREEAAKWSQ VRKDLCSLKV 
SLQLRGEDGS VWNYKPPADS GGKEIFSLLP HMADMSTYMF KGIISFAKVI 
SYFRDLPIED QISLLKGAAF ELCQLRFNTV FNAETGTWEC GRLSYCLEDT 
AGGFQQLLLE PMLKFHYMLK KLQLHEEEYV LMQAISLFSP DRPGVLQHRV 

25 VDQLQEQFAI TLKSYIECNR PQPAHRFLFL KIMAMLTELR SINAQHTQRL 
LRIQDIHPFA TPLMQELFGI TGS (SEQ ID NO: 18) . 

A comparison of the nucleotide sequences and related open 
reading frames of SEQ ID NO:l and SEQ ID NO: 17 which encode nNR7 
(SEQ ID NO:2) and nNR7-l (SEQ ID NO: 18) reveals nucleotide sequence 

30 divergence at the 5 1 end of these cDNA clones which results in different 
amino terminal regions for nNR7-l compared to nNR7. More 
specifically, an open reading frame from nucleotide 276 to nucleotide 350 
of SEQ ID NO:l encodes the NH2 terminal 25 amino acids of nNR7, as 
set forth in Figure 3 and SEQ ID NO:2. There is no initiating 

35 methionine residue in nNR7. In constrast, an initiating codon for 

methionine is present in SEQ ID NO: 17 (nucleotide residue 104-106) and 
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the 5' portion of the coding region of this cDNA clone differs from 
nucleotide 104 through nucleotide 198 of SEQ ID NO:17, which encodes 
the NH2 terminal 32 amino acids of nNR7-l (SEQ ID NO:18) that differ 
from the initial 25 amino acid residues of nNR7. In other words, nNR7 
5 and nNR7-l are identical from amino acid 26-466 of nNR7 and amino 
acid 33-473 ofnNR7-l. 

The present invention also relates to biologically functional 
derivatives and/or mutants of nNR7 as set forth as SEQ ID NO: 18, 
including but not necessarily limited to amino acid substitutions, 

10 deletions, additions, amino terminal truncations and carboxy-terminal 
truncations such that these mutations provide for proteins or protein 
fragments of diagnostic, therapeutic or prophylactic use and would be 
useful for screening for agonists and/or antagonists of nNK7-l function. 
As with many proteins, it is possible to modify many of the amino acids 

15 of nNR7 or nNR7-l, particularly those which are not found in the DNA 
binding domain, and still retain substantially the same biological 
activity as the original receptor. Thus this invention includes modified 
nNR7 and nNR7-l polypeptides which have amino acid deletions, 
additions, or substitutions but that still retain substantially the same 

20 biological activity as nNR7 and nNR7-l. It is generally accepted that 

single amino acid substitutions do not usually alter the biological activity 
of a protein (see, e.g., Molecular Biology of the Gene. Watson et al. t 1987, 
Fourth Ed., The Benjamin/Cnmmings Publishing Co., Inc., page 226; 
and Cunningham & Wells, 1989, Science 244:1081-1085). Accordingly, 

25 the present invention includes polypeptides where one amino acid 

substitution has been made in SEQiD.NO.:2 or SEQ ID NO: 18 wherein 
the polypeptides still retain substantially the same biological activity as 
nNR7 or nNR7-l. The present invention also includes polypeptides 
where two or more amino acid substitutions have been made in 

30 SEQ.ID.NO.:2 or SEQ ID NO:18 wherein the polypeptides still retain 
substantially the same biological activity as the wild type protein. In 
particular, the present invention includes embodiments where the 
above-described substitutions are conservative substitutions. In 
particular, the present invention includes embodiments where the 

35 above-described substitutions do not occur in the DNA-binding domain of 
nNR7 or nNR7-l. When deciding which amino acid residues of nNR7 or 
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nNR7-l may be substituted to produce polypeptides that are functional 
equivalents of nNR7 or nNR7-l, one skilled in the art would be guided by 
a comparison of the amino acid sequence of nNR7 or nNR7- 1 with the 
amino acid sequences of related proteins, One skilled in the art would 
5 also recognize that polypeptides that are functional equivalents of nNR7 
or nNR7-l and have changes from the aruino acid sequence of these 
respective proteins that are small deletions or insertions of amino acids 
could also be produced by following the same guidelines, (i.e, 
minimizing the differences in amino acid sequence between nNR7 or 

10 nNR7-l and related proteins. Small deletions or insertions are generally 
in the range of about 1 to 5 amino acids. The effect of such small 
deletions or insertions on the biological activity of the modified nNR7 or 
nNR7-l polypeptide can easily be assayed by producing the polypeptide 
synthetically or by making the required changes in DNA encoding nNR7 

15 or nNR7-l and then expressing the DNA recombinantly and assaying 
the protein produced by such recombinant expression. 

The present invention also relates to isolated nucleic acid 
molecules which are fusion constructions expressing fusion proteins 
useful in assays to identify compounds which modulate wild-type 

20 human nNR7 and/or nNR7-l activity. A preferred aspect of this portion 
of the invention includes, but is not limited to, glutathione S-transferase 
GST-nNR7 and/or GST-nNR7-l fusion constructs. These fusion 
constructs include, but are not limited to, all or a portion of the ligand- 
binding domain of nNR7 and/or nNR7-l, respectively, as an in-frame 

25 fusion at the carboxy terminus of the GST gene. The disclosure of SEQ 
ID Nos:l, 2, 17 and 18 allow the artisan of ordinary skill to construct any 
such nucleic acid molecule encoding a GST- nuclear receptor fusion 
protein. Soluble recombinant GST-nuclear receptor fusion proteins may 
be expressed in various expression systems, including Spodoptera 

30 frugiperda (Sf21) insect cells (Invitrogen) using a baculovirus 

expression vector (e.g., Bac-N-Blue DNA from Invitrogen or pAcG2T 
from Pharmingen). 

It is also within the purview of the artisan of ordinary skill 
to use the DNA molecules of the present invention to construct DNA 

35 expression vectors for use in transactivation assays to identify 

modulators of nNH7, nNR7-l and/or DNA molecules or proteins which 
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interact with nNR7-like proteins. For example, a portion of SEQ ID 
NO:l or SEQ ID NO: 17 may be fused downstream of an active promoter, 
such as a CMV promoter fragment, so as to overexpress a receptor 
protein. A second construct includes regulatory regions from the 
5 human CYP3A4 gene (such as the human CYP3A4 promoter region or 
a response element [e.g., SEQ ID NO:24]) fused to a reporter gene, such 
as SEAP (secreted placental alkaline phosphatase, LacZ or 
chloremphenicol acetly transferase [CAT]). These constructs may then 
be utilized in a transactivation assay to identify modulators of CYP3A4 

10 levels in vivo. It will be known that various modifications to either or 
both the receptor construct and/or the reporter construct may be made 
without effecting the effectiveness of the transactivation assay. For 
example, the native nNR7 or nNR7-l promoter may be utilized instead of 
a hybrid promoter such as the CMV promoter. Or it may be useful to 

15 use a different reporter gene, such as LacZ. 

Therefore, the present invention is also directed to methods 
for screening for compounds which modulate the expression of DNA or 
RNA encoding a human nNR7 and/or nNR7-l protein. Compounds 
which modulate these activities may be DNA, RNA, peptides, proteins, 

20 or non-pro teinaceous organic molecules. Compounds may modulate by 
increasing or attenuating the expression of DNA or RNA encoding 
human nNR7 and/or nNR7-l, or the function of human nNR7 and/or 
nNR7-l. Compounds that modulate the expression of DNA or RNA 
encoding human nNR7 and/or nNR7-l or the biological function thereof 

25 may be detected by a variety of assays. The assay may be a simple 

"yes/no" assay to determine whether there is a change in expression or 
function. The assay may be made quantitative by comparing the 
expression or function of a test sample with the levels of expression or 
function in a standard sample. Kits containing human nNR7 and/or 

30 nNR7-l, antibodies to human nNR7 and/or nNR7-l, or modified human 
nNR7 and/or nNR7-l may be prepared by known methods for such uses. 

The DNA molecules, RNA molecules, recombinant protein 
and antibodies of the present invention may be used to screen and 
measure levels of human nNR7 and/or nNR7-l. The recombinant 

35 proteins, DNA molecules, RNA molecules and antibodies lend 

themselves to the formulation of kits suitable for the detection and typing 
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of human nNR7 and/or nNR7-L Such a kit would comprise a 
compartmentalized carrier suitable to hold in close confinement at least 
one container. The carrier would further comprise reagents such as 
recombinant nNR7 and/or nNR7-l or anti-nNR7 and/or nNR7-l 
5 antibodies suitable for detecting human nNR7 and/or nNR7-l. The 
carrier may also contain a means for detection such as labeled antigen 
or enzyme substrates or the like. 

Pharmaceutically useful compositions comprising 
modulators of human nNR7 and/or nNR7-l may be formulated 

10 according to known methods such as by the admixture of a 

pharmaceutically acceptable carrier. Examples of such carriers and 
methods of formulation may be found in Remington's Pharmaceutical 
Sciences. To form a pharmaceutically acceptable composition suitable 
for effective administration, such compositions will contain an effective 

15 amount of the protein, DNA, RNA, modified human nNR7 and/or 
nNR7-l, or either nNR7 and/or nNR7-l agonists or antagonists. 

Therefore, the present invention includes assays by which 
modulators of cytochrome P450 enzymes, especially CYP 450 enzymes 
which comprise the response element 5'-TGAACTCAAAGGAGGTCA- 

20 3' (SEQ ID NO:24), and especially human CYP3A4. Methods for 

identifying agonists and antagonists of other receptors are well known 
in the art and can be adapted to identify compounds which effect in vivo 
levels of CYP3A4. Accordingly, the present invention includes a method 
for determining whether a substance is a potential modulator of 

25 CYP3A4 levels that comprises: 

(a) transfecting or transforming cells with an 
expression vector encoding nNR7 or nNR7-l, also known as the receptor 
vector; 

(b) transfecting or transforming the cells of step (a) with 
30 second expression vector which comprises a response element known to 

bind nNR7 or nNR7-l and a promoter fragment fused upstream of a 
reporter gene, also known as a reporter vector. 

(c) allowing the transfected cells to grow for a time 
sufficient to allow nNR7 or nNK7-l to be expressed; 

35 (d) exposing the test cells to a substance while not 

exposing control cells to the test substance; 
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(e) measuring the expression of the reporter gene in both 
the test cells and control cells; where if the amount of binding of 
expression in the test ceils is greater than in the control cells, the 
substance may enhance or act as an agonist to CYP3A4 activity, 
5 whereas the opposite effect may suggest a possible antagonist of CYP3A4 
activity. 

The conditions under which step (d) of the method is 
practiced are conditions that are typically used in the art for the study of 
protein-ligand interactions: e.g., physiological pH; salt conditions such 
10 as those represented by such commonly used buffers as PBS or in tissue 
culture media; a temperature of about 4°C to about 55°C. 

Alternatively, the transactivation assay may be carried out 
as follows: (a) provide test cells by transfecting cells with a receptor 
expression vector that directs the expression of nNR7 or nNR7-l in the 
15 cells; 

(b) providing test cells by transfecting the cells of step (a) 
with a second reporter expression vector that directs expression of a 
reporter gene under control of a regulatory element which is responsive 
to nNR7 or nNR7-l 

20 (b) exposing the test cells to the substance; 

(c) measuring the amount of binding of expression of the 
reporter gene; 

(d) comparing the amount of expression of the reporter 
gene in the test cells with the amount of expression of the reporter gene 

25 in control cells that has been transfected with a reporter vector of step (b) 
but not a receptor vector of step (a). 

Therefore, it is evident that any number of variations known to one of 
skill in the art may be utilized in order to provide for an assay to 
measure the effect of a substance on the ability of the nuclear receptor 

30 proteins of the present invention to effect transcription of a promoter of 
interest (e.g., effecting CYP3A4 gene expression). 

The present invention also includes a method for 
determining whether a substance is capable of binding to nNR7 and/or 
nNR7-l, i.e., whether the substance is a potential agonist or an 

35 antagonist of nNR7 and/or nNR7-l, where the method comprises: 
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(a) providing test cells by transfecting cells with an 
expression vector that directs the expression of nNR7 and/or nNR7-l in 
the cells; 

(b) exposing the test cells to the substance; 

5 (c) measuring the amount of binding of the substance to 

nNR7 and/or r±NR7-l; 

(d) comparing the amount of binding of the substance to 
nNR7 or nNR7-l in the test cells with the amount of binding of the 
substance to control cells that have not been transfected with nNR7 or 

0 nNR7-l; wherein if the amount of binding of the substance is greater in 
the test cells as compared to the control cells, the substance is capable of 
binding to nNR7 or nNR7-L Determining whether the substance is 
actually an agonist or antagonist can then be accomplished by the use of 
functional assays such as the transactivation assay as described above. 

5 The conditions under which step (b) of the method is 

practiced are conditions that are typically used in the art for the study of 
protein-ligand interactions: e.g., physiological pH; salt conditions such 
as those represented by such commonly used buffers as PBS or in tissue 
culture media; a temperature of about 4°C to about 55°C. 

0 The assays described above can be carried out with cells 

that have been transiently or stably transfected with nNR7 or nNR7-l. 
Transfection is meant to include any method known in the art for 
introducing nNR7 or nNR7-l into the test cells. For example, 
transfection includes calcium phosphate or calcium chloride mediated 

5 transfection, lipofection, infection with a retroviral construct containing 
nNR7 or nNR7-l, and electroporation. 

Where binding of the substance or agonist to nNR7 or 
nNR7-l is measured, such binding can be measured by employing a 
labeled substance or agonist. The substance or agonist can be labeled in 

0 any convenient manner known to the art, e.g., radioactively, 
fluorescently, enzymatically. 

In particular embodiments of the above-described methods, 
nNR7 or nNR7-l has an amino acid sequence of SEQ ID NO:2. 

The isolated nucleic acid molecule of the present invention 

5 may include a deoxyribonucleic acid molecule (DNA), such as genomic 
DNA and complementary DNA (cDNA), which may be single (coding or 
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noncoding strand) or double stranded, as well as synthetic DNA, such 
as a synthesized, single stranded polynucleotide. The isolated nucleic 
acid molecule of the present invention may also include a ribonucleic 
acid molecule (RNA), 
5 It is known that there is a substantial amount of 

redundancy in the various codons which code for specific amino 
acids. Therefore, this invention is also directed to those DNA 
sequences encode RNA comprising alternative codons which code for 
the eventual translation of the identical amino acid, as shown below: 

10 A=Ala=Alanine: codons GCA, GCC, GCG, GCU 
C=Cys=Cysteine: codons UGC, UGU 
D=Asp=Aspartic acid: codons GAC, GAU 
E=Glu=Glutamic acid: codons GAA, GAG 
F=Phe=Phenylalanine: codons UUC, UUU 

15 G=Gly=Glycine: codons GGA, GGC, GGG, GGU 
H=His =Histidine: codons CAC, CAU 
I=Ile =Isoleucine: codons AUA, AUC, AUU 
K=Lys=Lysine: codons AAA, AAG 

L=Leu=Leucine: codons UUA, UUG, CUA, CUC, CUG, CUU 
20 M=Met=Methionine: codon AUG 

N=Asp=Asparagine: codons AAC, AAU 
P=Pro=Proline: codons CCA, CCC, CCG, CCU 
Q=Gln=Glutamine: codons CAA, CAG 

R=Arg=Arginine: codons AGA, AGG, CGA, CGC, CGG, CGU 
25 S=Ser=Serine: codons AGC, AGU, UCA, UCC, UCG, UCU 

T=Thr=Threonine: codons ACA, ACC, ACG, ACU 

V=Val=Valine: codons GUA, GUC, GUG, GUU 

W=Trp=Tryptophan: codon UGG 

Y=Tyr=Tyrosine: codons UAC, UAU 
30 Therefore, the present invention discloses codon redundancy which 

may result in differing DNA molecules expressing an identical 

protein. For purposes of this specification, a sequence bearing one or 

more replaced codons will be defined as a degenerate variation. Also 

included within the scope of this invention are mutations either in 
35 the DNA sequence or the translated protein which do not 

substantially alter the ultimate physical properties of the expressed 
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protein. For example, substitution of valine for leucine, arginine for 
lysine, or asparagine for glutamine may not cause a change in 
functionality of the polypeptide. 

It is known that DNA sequences coding for a peptide 
5 may be altered so as to code for a peptide having properties that are 
different than those of the naturally occurring peptide. Methods of 
altering the DNA sequences include but are not limited to site 
directed mutagenesis. Examples of altered properties include but are 
not limited to changes in the affinity of an enzyme for a substrate or a 
10 receptor for a ligand. 

As used herein, "purified" and "isolated" are utilized 
interchangeably to stand for the proposition that the nucleic acid, 
protein, or respective fragment thereof in question has been 
substantially removed from its in vivo environment so that it may be 

15 manipulated by the skilled artisan, such as but not limited to nucleotide 
sequencing, restriction digestion, site-directed mutagenesis, and 
subcloning into expression vectors for a nucleic acid fragment as well as 
obtaining the protein or protein fragment in pure quantities so as to 
afford the opportunity to generate polyclonal antibodies, monoclonal 

20 antibodies, amino acid sequencing, and peptide digestion. Therefore, 
the nucleic acids claimed herein may be present in whole cells or in cell 
lysates or in a partially purified or substantially purified form. A 
nucleic acid is considered substantially purified when it is purified away 
from environmental contaminants. Thus, a nucleic acid sequence 

25 isolated from cells is considered to be substantially purified when 
purified from cellular components by standard methods while a 
chemically synthesized nucleic acid sequence is considered to be 
substantially purified when purified from its chemical precursors. 

The present invention also relates to recombinant vectors 

30 and recombinant hosts, both prokaryotic and eukaryotic, which contain 
the substantially purified nucleic acid molecules disclosed throughout 
this specification. 

Therefore, the present invention also relates to methods of 
expressing nNR7 and/or nNR7-l and biological equivalents disclosed 

35 herein, assays employing these recombinantly expressed gene products, 
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cells expressing these gene products, and agonistic and/or antagonistic 
compounds identified through the use of assays utilizing these 
recombinant forms, including, but not limited to, one or more 
modulators of the human nNR7 and/or nNR7-l either through direct 
5 contact LBD or through direct or indirect contact with a ligand which 
either interacts with the DBD or with the wild-type transcription 
complex which nNR7 and/or nNR7-l interacts in trans, thereby 
modulating cell differentiation or cell development. 

As used herein, a biologically functional derivative* of a 

10 wild-type human nNR7 and/or nNR7-l possesses a biological activity 
that is related to the biological activity of the wild type human nNR7 
and/or nNR7-l. The term "functional derivative" is intended to include 
the "fragments," "mutants," "variants," "degenerate variants," 
"analogs" and "homologues" of the wild type human nNR7 and/or nNR-1 

15 protein. The term "fragment" is meant to refer to any polypeptide subset 
of wild-type human nNR7 and/or nNR7-l, including but not necessarily 
limited to amino acid substitutions, deletions, additions, amino terminal 
truncations and carboxy- terminal truncations. The term "mutant" is 
meant to refer a subset of a biologically active fragment that may be 

20 substantially similar to the wild-type form but possesses distinguishing 
biological characteristics. Such altered characteristics include but are 
in no way limited to altered substrate binding, altered substrate affinity 
and altered sensitivity to chemical compounds affecting biological 
activity of the human nNR7 and/or nNR7-l or human nNR7 and/or 

25 nNR7-l functional derivative. The term "variant" is meant to refer to a 
molecule substantially similar in structure and function to either the 
entire wild-type protein or to a fragment thereof. A molecule is 
"substantially similar" to a wild- type human nNE7 and/or nNE7-l-like 
protein if both molecules have substantially similar structures or if both 

30 molecules possess similar biological activity. Therefore, if the two 

molecules possess substantially similar activity, they are considered to 
be variants even if the structure of one of the molecules is not found in 
the other or even if the two amino acid sequences are not identical. The 
term "analog" refers to a molecule substantially similar in function to 

35 either the full-length human nNR7 and/or nNR7-l protein or to a 
biologically functional derivative thereof. 
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Any of a variety of procedures may be used to clone human 
nNR7 and/or nNR7-l. These methods include, but are not limited to, (1) 
a RACE PGR cloning technique (Frohman, et al., 1988, Proc. Natl. 
Acad. Sci. USA 85: 8998-9002). 5c and/or 3tf RACE may be performed to 
5 generate a full-length cDNA sequence. This strategy involves using 
gene-specific oligonucleotide primers for PCR amplification of human 
nNR7 and/or nNR7-l cDNA. These gene-specific primers are designed 
through identification of an expressed sequence tag (EST) nucleotide 
sequence which has been identified by searching any number of publicly 

10 available nucleic acid and protein databases; (2) direct functional 
expression of the human nNR7 and/or nNR7-l cDNA following the 
construction of a human nNR7 and/or nNR7-l- containing cDNA library 
in an appropriate expression vector system; (3) screening a human 
nNR7 and/or nNR7-l-containing cDNA library constructed in a 

15 bacteriophage or plasmid shuttle vector with a labeled degenerate 
oligonucleotide probe designed from the amino acid sequence of the 
human nNR7 and/or nNR7-l protein; (4) screening a human nNR7 
and/or nNR7-l-containing cDNA library constructed in a bacteriophage 
or plasmid shuttle vector with a partial cDNA encoding the human 

20 nNR7 and/or nNR7-l protein. This partial cDNA is obtained by the 
specific PCR amplification of human nNR7 and/or nNR7-l DNA 
fragments through the design of degenerate oligonucleotide primers 
from the amino acid sequence known for other kinases which are 
related to the human nNR7 and/or nNR7-l protein; (5) screening a 

25 human nNR7 and/or nNR7-l-containing cDNA library constructed in a 
bacteriophage or plasmid shuttle vector with a partial cDNA encoding 
the human nNR7 and/or nNR7-l protein. This strategy may also involve 
using gene-specific oligonucleotide primers for PCR amplification of 
human nNR7 and/or nNR7-l cDNA identified as an EST as described 

30 above; or (6) designing 5c and 3^ gene specific oligonucleotides using 
SEQ ID NO: 1 as a template so that either the full-length cDNA may be 
generated by known PCR techniques, or a portion of the coding region 
may be generated by these same known PCR techniques to generate and 
isolate a portion of the coding region to use as a probe to screen one of 

35 numerous types of cDNA and/or genomic libraries in order to isolate a 
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full-length version of the nucleotide sequence encoding human nNR7 
and/or nNR7-l. 

It is readily apparent to those skilled in the art that other 
types of libraries, as well as libraries constructed from other cell types-or 

5 species types, may be useful for isolating a nNR7 and/or nNR7-l- 
encoding DNA or a nNR7 and/or nNR7-l homologue. Other types of 
libraries include, but are not limited to, cDNA libraries derived from 
other cells or cell lines other than human cells or tissue such as murine 
cells, rodent cells or any other such vertebrate host which may contain 

0 nNR7 and/or nNR7- 1-encoding DNA. Additionally a nNR7 and/or 
nNR7-l gene and homologues may be isolated by oligonucleotide- or 
polynucleotide-based hybridization screening of a vertebrate genomic 
library, including but not limited to, a murine genomic library, a rodent 
genomic library, as well as concomitant human genomic DNA 

5 libraries. 

It is readily apparent to those skilled in the art that suitable 
cDNA libraries may be prepared from cells or cell lines which have 
nNR7 and/or nNR7-l activity. The selection of cells or cell lines for use 
in preparing a cDNA library to isolate a cDNA encoding nNR7 and/or 

0 nNR7-l may be done by first measuring cell-associated nNR7 and/or 
nNR7-l activity using any known assay available for such a purpose. 

Preparation of cDNA libraries can be performed by 
standard techniques well known in the art. Well known cDNA library 
construction techniques can be found for example, in Sambrook et al., 

5 1989, Molecular Cloning: A Laboratory Manual; Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York. Complementary DNA 
libraries may also be obtained from numerous commercial sources, 
including but not limited to Clontech Laboratories, Inc. and Stratagene. 

It is also readily apparent to those skilled in the art that 

0 DNA encoding human nNK7 and/or nNR7-l may also be isolated from a 
suitable genomic DNA library. Construction of genomic DNA libraries 
can be performed by standard techniques well known in the art. Well 
known genomic DNA library construction techniques can be found in 
Sambrook, et al., supra. 

5 In order to clone the human nNR7 and/or nNR7-l gene by 

one of the preferred methods, the amino acid sequence or DNA sequence 
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of human nNR7 and/or nNR7-l or a homologous protein may be 
necessary. To accomplish this, the nNR7 and/or nNR7-l protein or a 
homologous protein may be purified and partial amino acid sequence 
determined by automated sequenators. It is not necessary to determine 
5 the entire amino acid sequence, but the linear sequence of two regions of 
6 to 8 amino acids can be determined for the PCR amplification of a 
partial human nNR7 and/or nNR7-l DNA fragment. Once suitable 
amino acid sequences have been identified, the DNA sequences capable 
of encoding them are synthesized. Because the genetic code is 

10 degenerate, more than one codon may be used to encode a particular 
amino acid, and therefore, the amino acid sequence can be encoded by 
any of a set of similar DNA oligonucleotides. Only one member of the set 
will be identical to the human nNR7 and/or nNR7-l sequence but others 
in the set will be capable of hybridizing to human nNR7 and/or nNR7-l 

15 DNA even in the presence of DNA oligonucleotides with mismatches. 
The mismatched DNA oligonucleotides may still sufficiently hybridize to 
the human nNR7 and/or nNR7-l DNA to permit identification and 
isolation of human nNR7 and/or nNR7-l encoding DNA. Alternatively, 
the nucleotide sequence of a region of an expressed sequence may be 

20 identified by searching one or more available genomic databases. Gene- 
specific primers may be used to perform PCR amplification of a cDNA of 
interest from either a cDNA library or a population of cDNAs. As noted 
above, the appropriate nucleotide sequence for use in a PCR-based 
method may be obtained from SEQ ID NO: 1, either for the purpose of 

25 isolating overlapping 5c and 3c RACE products for generation of a full- 
length sequence coding for human nNR7 and/or nNR7-l, or to isolate a 
portion of the nucleotide sequence coding for human nNR7 and/or 
nNR7-l for use as a probe to screen one or more cDNA- or genomic- 
based libraries to isolate a full-length sequence encoding human nNR7 

30 and/or nNR7-l-like proteins. 

In one exemplified method, the human nNR7 full-length 
cDNA of the present invention were generated by screening a mixed 
cDNA library with DNA fragments corresponding to human AR, ERb, 
GR and VDR DNA binding domain regions. Positive clones were 

35 excised from 1ZAPII phages and the plasmid DNA was sequenced 
directly M13 forward and reverse primers. A Blast search of the 
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sequence information was obtained using computer program 
PhredPhrap. A single cDNA clone was shown to comprise the open 
reading frame of a novel human nuclear receptor. DNA sequencing 
was continued from both 5' and 3' ends on nNR7, which identified the 
5 presence of an intron, A cDNA fragment flanking the intron region was 
retrieved via PGR using a first oligonucleotide primer primer pair and 
double nested primers. A prominent DNA fragment of -1.5 kb was 
amplified from testis and brain cDNAs. The fragment was purified and 
subcloned into pCRII vector (Invitrogen, San Diego, CA). Automated 

10 sequencing was performed on several of the clones. The complete 
sequencing revealed a 3.1 kb cDNA which codes an authentic novel 
nuclear receptor, nNR7. 

In a second exemplified method oligo primers were used to 
amplify nNR7 5' cDNA ends from human liver cDNA. DNA product 

15 from one round PCR reaction was gel purified, subcloned into a plasmid 
vector and subjected to nucleotide sequencing. Ambiguities and/or 
discrepancies between automated base calling in sequencing reads were 
visually examined and edited to the correct base call. One clone was 
found to have an open reading frame differing from the open reading 

20 frame disclosed for SEQ ID NO:l, such that this additional exemplifed 
cDNA (as disclosed in SEQ ID NO: 17) contains an in frame Met and up- 
stream stop codons. The site of divergence (as discussed herein) is the 
same as the other two additional clones derived from 5 '-RACE (rapid 
amplification of cDNA ends). The cDNA clone disclosed in Figure 4A- 

25 4C and set forth as SEQ ID NO:17 is a complementary DNA sequence to 
a completely processed mRNA which expresses an especially preferred 
nuclear receptor protein of the present invention (i.e, nNR7-l). 

A variety of mammalian expression vectors may be used to 
express recombinant human nNR7 and/or nNR7-l in mammalian cells. 

30 Expression vectors are defined herein as DNA sequences that are 

required for the transcription of cloned DNA and the translation of their 
mRNAs in an appropriate host Such vectors can be used to express 
eukaryotic DNA in a variety of hosts such as bacteria, blue green algae, 
plant cells, insect cells and animal cells. Specifically designed vectors 

35 allow the shuttling of DNA between hosts such as bacteria-yeast or 

bacteria-animal cells. An appropriately constructed expression vector 
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should contain: an origin of replication for autonomous replication in 
host cells, selectable markers, a limited number of useful restriction 
enzyme sites, a potential for high copy number, and active promoters, A 
promoter is defined as a DNA sequence that directs RNA polymerase to 
5 bind to DNA and initiate RNA synthesis. A strong promoter is one 
which causes mRNAs to be initiated at high frequency. Expression 
vectors may include, but are not limited to, cloning vectors, modified 
cloning vectors, specifically designed plasmids or viruses. 

Commercially available mammalian expression vectors 

10 which may be suitable for recombinant human nNR7 and/or nNR7-l 
expression, include but are not limited to pcDNA3.1 (Invitrogen), 
pLITMUS28, pLITMUS29, pLITMUS38 and pLITMUS39 (New England 
Bioloabs), pcDNAI, pcDNAIamp (Invitrogen), pcDNA3 (Invitrogen), 
pMClneo (Stratagene), pXTl (Stratagene), pSG5 (Stratagene), EBO- 

15 pSV2-neo (ATCC 37593) pBPV-l(8-2) (ATCC 37110), pdBPV-MMTneo(342- 
12) (ATCC 37224), pRSVgpt (ATCC 37199), pRSVneo (ATCC 37198), 
pSV2-dhfr (ATCC 37146), pUCTag (ATCC 37460), and 1ZD35 (ATCC 
37565). 

A variety of bacterial expression vectors may be used to 

20 express recombinant human nNR7 and/or nNR7-l in bacterial cells. 
Commercially available bacterial expression vectors which may be 
suitable for recombinant human nNR7 and/or nNR7-l expression 
include, but are not limited to pCRII (Invitrogen), pCR2.1 (Invitrogen), 
pQE (Qiagen), pETlla (Novagen), lambda gtll (Invitrogen), and pKK223- 

25 3 (Pharmacia). 

A variety of fungal cell expression vectors may be used to 
express recombinant human nNR7 and/or nNR7-l in fungal cells. 
Commercially available fungal cell expression vectors which may be 
suitable for recombinant human nNR7 and/or nNR7-l expression 

30 include but are not limited to pYES2 (Invitrogen) and Pichia expression 
vector (Invitrogen). 

A variety of insect cell expression vectors may be used to 
express recombinant receptor in insect cells. Commercially available 
insect cell expression vectors which may be suitable for recombinant 

35 expression of human nNR7 and/or nNR7-l include but are not limited to 
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pBlueBacIII and pBlueBacHis2 (Invitrogen), and pAcG2T 
(Pharmingen). 

An expression vector containing DNA encoding a human 
nNR7 and/or nNR7-l-like protein may be used for expression of human 
5 nNR7 and/or nNR7-l in a recombinant host cell. Recombinant host cells 
may be prokaryotic or eukaryotic, including but not limited to bacteria 
such as E. coli, fungal cells such as yeast, mammalian cells including 
but not limited to cell lines of human, bovine, porcine, monkey and 
rodent origin, and insect cells including but not limited to Drosophila- 

10 and silkworm-derived cell lines. Cell lines derived from mammalian 
species which may be suitable and which are commercially available, 
include but are not limited to, L cells L-M(TK') (ATCC CCL 1.3), L cells 
L-M (ATCC CCL 1.2), Saos-2 (ATCC HTB-85), 293 (ATCC CRL 1573), Raji 
(ATCC CCL 86), CV-1 (ATCC CCL 70), COS-1 (ATCC CRL 1650), COS-7 

15 (ATCC CRL 1651), CHO-K1 (ATCC CCL 61), 3T3 (ATCC CCL 92), 

NIH/3T3 (ATCC CRL 1658), HeLa (ATCC CCL 2), C127I (ATCC CRL 
1616), BS-C-1 (ATCC CCL 26), MRC-5 (ATCC CCL 171) and CPAE 
(ATCC CCL 209). 

The expression vector may be introduced into host cells via 

20 any one of a number of techniques including but not limited to 

transformation, transfection, protoplast fusion, and electroporation. 
The expression vector-containing cells are individually analyzed to 
determine whether they produce human nNR7 and/or nNR7- 1 protein. 
Identification of human nNR7 and/or nNR7-l expressing cells may be 

25 done by several means, including but not limited to immunological 
reactivity with anti-human nNR7 and/or nNR7-l antibodies, labeled 
ligand binding and the presence of host cell-associated human nNR7 
and/or nNR7-l activity. 

The cloned human nNR7 and/or nNR7-l cDNA obtained 

30 through the methods described above may be recombinantly expressed 
by molecular cloning into an expression vector (such as pcDNA3.1, pQE, 
pBlueBacHis2 and pLITMUS28) containing a suitable promoter and 
other appropriate transcription regulatory elements, and transferred 
into prokaryotic or eukaryotic host cells to produce recombinant human 

35 nNR7 and/or nNR7-l. Techniques for such manipulations can be found 
described in Sambrook, et aL, supra , are discussed at length in the 
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Example section and are well known and easily available to the artisan 
of ordinary skill in the art. 

Expression of human nNR7 and/or nNR7-l DNA may also 
be performed using in vitro produced synthetic mRNA. Synthetic 
5 mRNA can be efficiently translated in various cell-free systems, 
including but not limited to wheat germ extracts and reticulocyte 
extracts, as well as efficiently translated in cell based systems, 
including but not limited to microinjection into frog oocytes, with 
microinjection into frog oocytes being preferred. 

10 To determine the human nNR7 and/or nNR7-l cDNA 

sequence(s) that yields optimal levels of human nNR7 and/or nNR7-l, 
cDNA molecules including but not limited to the following can be 
constructed: a cDNA fragment containing the full-length open reading 
frame for human nNR7 and/or nNR7-l as well as various constructs 

15 containing portions of the cDNA encoding only specific domains of the 
protein or rearranged domains of the protein. All constructs can be 
designed to contain none, all or portions of the 5# and/or 3<J untranslated 
region of a human nNR7 and/or nNR7-l cDNA. The expression levels 
and activity of human nNR7 and/or nNK7-l can be determined following 

20 the introduction, both singly and in combination, of these constructs into 
appropriate host cells. Following determination of the human nNR7 
and/or nNR7-l cDNA cassette yielding optimal expression in transient 
assays, this nNR7 and/or nNR7-l cDNA construct is transferred to a 
variety of expression vectors (including recombinant viruses), including 

25 but not limited to those for mammalian cells, plant cells, insect cells, 
oocytes, bacteria, and yeast cells. 

The present invention also relates to polyclonal and 
monoclonal antibodies raised in response to either the human form of 
nNR7 and/or nNR7-l disclosed herein, or a biologically functional 

30 derivative thereof. It will be especially preferable to raise antibodies 
against epitopes within the NH2-terminal domain of nNK7 and/or 

nNR7-l, which show the least homology to other known proteins 
belonging to the human nuclear receptor superfamily. 

Recombinant nNR7 and/or nNR7-l protein can be separated 
35 from other cellular proteins by use of an lmmuno affinity column made 
with monoclonal or polyclonal antibodies specific for full-length nNR7 
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and/or nNR7-l protein, or polypeptide fragments of nNR7 and/or nNR7-l 
protein. Additionally, polyclonal or monoclonal antibodies may be 
raised against a synthetic peptide (usually from about 9 to about 25 
amino acids in length) from a portion of the protein as disclosed in SEQ 
5 ID NO: 2 or SEQ ID NO: 18. Monospecific antibodies to human nNR7 
and/or nNR7-l are purified from mammalian antisera containing 
antibodies reactive against human nNR7 and/or nNR7-l or are prepared 
as monoclonal antibodies reactive with human nNR7 and/or nNR7-l 
using the technique of Kohler and Milstein (1975, Nature 256: 495-497). 

10 Monospecific antibody as used herein is defined as a single antibody 
species or multiple antibody species with homogenous binding 
characteristics for human nNH7 and/or nNR7-l. Homogenous binding 
as used herein refers to the ability of the antibody species to bind to a 
specific antigen or epitope, such as those associated with human nNR7 

15 and/or nNR7-l, as described above. Human nNR7 and/or nNR7-l- 
specific antibodies are raised by immunizing animals such as mice, 
rats, guinea pigs, rabbits, goats, horses and the like, with an 
appropriate concentration of human nNR7 and/or nNR7-l protein or a 
synthetic peptide generated from a portion of human nNR7 and/or 

20 nNR7-l with or without an immune adjuvant. 

Preimmune serum is collected prior to the first 
immunization. Each animal receives between about 0.1 mg and about 
1000 mg of human nNR7 and/or nNR7-l protein associated with an 
acceptable immune adjuvant. Such acceptable adjuvants include, but 

25 are not limited to, Freund's complete, Freund's incomplete, alum- 
precipitate, water in oil emulsion containing Corynebacterium parvum 
and tRNA. The initial immunization consists of human nNR7 and/or 
nNR7-l protein or peptide fragment thereof in, preferably, Freund's 
complete adjuvant at multiple sites either subcutaneously (SO, 

30 intraperitoneally (IP) or both. Each animal is bled at regular intervals, 
preferably weekly, to detennine antibody titer. The animals may or may 
not receive booster injections following the initial immunization. Those 
animals receiving booster injections are generally given an equal 
amount of human nNR7 and/or nNR7-l in Freund's incomplete 

35 adjuvant by the same route. Booster injections are given at about three 
week intervals until maximal titers are obtained. At about 7 days after 
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each booster immunization or about weekly after a single 
immunization, the animals are bled, the serum collected, and aliquots 
are stored at about -20°C. 

Monoclonal antibodies (mAb) reactive with human nNR7 
5 and/or nNR7-l are prepared by immunizing inbred mice, preferably 
Balb/c, with human nNR7 and/or nNR7-l protein. The mice are 
immunized by the IP or SC route with about 1 mg to about 100 mg, 
preferably about 10 mg, of human nNR7 and/or nNR7-l protein in about 
0.5 ml buffer or saline incorporated in an equal volume of an acceptable 

10 adjuvant, as discussed above. Freund's complete adjuvant is preferred. 
The mice receive an initial immunization on day 0 and are rested for 
about 3 to about 30 weeks. Immunized mice are given one or more 
booster immunizations of about 1 to about 100 mg of human nNR7 
and/or nNR7-l in a buffer solution such as phosphate buffered saline by 

15 the intravenous (IV) route. Lymphocytes, from antibody positive mice, 
preferably splenic lymphocytes, are obtained by removing spleens from 
immunized mice by standard procedures known in the art. Hybridoma 
cells are produced by mixing the splenic lymphocytes with an 
appropriate fusion partner, preferably myeloma cells, under conditions 

20 which will allow the formation of stable hybridomas. Fusion partners 
may include, but are not limited to: mouse myelomas P3/NSl/Ag 4-1, 
MPC-11, S-194 and Sp 2/0, with Sp 2/0 being preferred. The antibody 
producing cells and myeloma cells are fused in polyethylene glycol, 
about 1000 mol. wt., at concentrations from about 30% to about 50%. 

25 Fused hybridoma cells are selected by growth in hypoxanthine, 

thymidine and aminopterin supplemented Dulbecco's Modified Eagles 
Medium (DMEM) by procedures known in the art. Supernatant fluids 
are collected form growth positive wells on about days 14, 18, and 21 and 
are screened for antibody production by an immunoassay such as solid 

30 phase immunoradioassay (SPIRA) using human nNR7 and/or nNR7-l 
as the antigen. The culture fluids are also tested in the Ouchterlony 
precipitation assay to determine the isotype of the mAb. Hybridoma 
cells from antibody positive wells are cloned by a technique such as the 
soft agar technique of MacPherson, 1973, Soft Agar Techniques, in 

35 Tissue Culture Methods and Applications, Kruse and Paterson, Eds., 
Academic Press. 
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Monoclonal antibodies are produced in vivo by injection of 
pristine primed Balb/c mice, approximately 0.5 ml per mouse, with 
about 2 x 106 to about 6 x 106 hybridoma cells about 4 days after priming. 
Ascites fluid is collected at approximately 8-12 days after cell transfer 
5 and the monoclonal antibodies are purified by techniques known in the 
art. 

In vitro production of anti-human nNR7 and/or nNR7-l 
mAb is carried out by growing the hybridoma in DMEM containing 
about 2% fetal calf serum to obtain sufficient quantities of the specific 

10 mAb. The mAb are purified by techniques known in the art. 

Antibody titers of ascites or hybridoma culture fluids are 
determined by various serological or immunological assays which 
include, but are not limited to, precipitation, passive agglutination, 
enzyme-linked immunosorbent antibody (ELISA) technique and 

15 radioimmunoassay (RIA) techniques. Similar assays are used to detect 
the presence of human nNR7 and/or nNR7-l in body fluids or tissue and 
cell extracts. 

It is readily apparent to those skilled in the art that the 
above described methods for producing monospecific antibodies may be 

20 utilized to produce antibodies specific for human nNR7 and/or nNR7-l 
peptide fragments, or full-length human nNR7 and/or nNR7-l. 

Human nNR7 and/or nNR7-l antibody affinity columns are 
made, for example, by adding the antibodies to Affigel-10 (Biorad), a gel 
support which is pre-activated with N-hydroxysuccinimide esters such 

25 that the antibodies form covalent linkages with the agarose gel bead 

support. The antibodies are then coupled to the gel via amide bonds with 
the spacer arm. The remaining activated esters are then quenched with 
1M ethanolamine HC1 (pH 8.0). The column is washed with water 
followed by 0.23 M glycine HC1 CpH 2.6) to remove any non-conjugated 

30 antibody or extraneous protein. The column is then equilibrated in 

phosphate buffered saline CpH 7.3) and the cell culture supernatants or 
cell extracts containing full-length human nNR7 and/or nNR7-l or 
human nNR7 and/or nNR7-l protein fragments are slowly passed 
through the column. The column is then washed with phosphate 

35 buffered saline until the optical density (A280) falls to background, then 
the protein is eluted with 0.23 M glycine-HCl (pH 2.6). The purified 
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human nNK7 and/or nNR7-l protein is then dialyzed against phosphate 
buffered saline. 

Levels of human nNR7 and/or nNR7-l in host cells is 
quantified by a variety of techniques including, but not limited to, 

5 immuno affinity and/or ligand affinity techniques. nNK7 and/or nNR7- 
1-specific affinity beads or nNR7 and/or nNR7-l-specific antibodies are 
used to isolate 35S-methionine labeled or unlabelled nNR7 and/or 
nNR7-l. Labeled nNR7 and/or nNR7-l protein is analyzed by 
SDS-PAGE. Unlabelled nNR7 and/or nNR7-l protein is detected by 

10 Western blotting, ELISA or RIA assays employing either nNR7 and/or 
nNR7-l protein specific antibodies and/or antiphosphotyrosine 
antibodies. 

Following expression of nNR7 and/or nNR7-l in a host cell, 
nNR7 and/or nNR7-l protein may be recovered to provide nNR7 and/or 

15 nNR7-l protein in active form. Several nNR7 and/or nNR7-l protein 

purification procedures are available and suitable for use. Recombinant 
nNR7 and/or nNR7-l protein may be purified from cell ly sates and 
extracts, or from conditioned culture medium, by various combinations 
of, or individual application of salt fractionation, ion exchange 

20 chromatography, size exclusion chromatography, hydroxylapatite 
adsorption chromatography and hydrophobic interaction 
chromatography. 

Therapeutic or diagnostic compositions of the invention are 
administered to an individual in amounts sufficient to treat or diagnose 

25 disorders. The effective amount may vary according to a variety of 

factors such as the individual's condition, weight, sex and age. Other 
factors include the mode of administration. 

The pharmaceutical compositions may be provided to the 
individual by a variety of routes such as subcutaneous, topical, oral and 

30 intramuscular. 

The term "chemical derivative" describes a molecule that 
contains additional chemical moieties which are not normally a part of 
the base molecule. Such moieties may improve the solubility, half-life, 
absorption, etc. of the base molecule. Alternatively the moieties may 

35 attenuate undesirable side effects of the base molecule or decrease the 
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toxicity of the base molecule. Examples of such moieties are described 
in a variety of texts, such as Remington's Pharmaceutical Sciences. 

Compounds identified according to the methods disclosed 
herein may be used alone at appropriate dosages. Alternatively, 
5 co-administration or sequential administration of other agents may be 
desirable. 

The present invention also has the objective of providing 
suitable topical, oral, systemic and parenteral pharmaceutical 
formulations for use in the novel methods of treatment of the present 

10 invention. The compositions containing compounds identified 

according to this invention as the active ingredient can be administered 
in a wide variety of therapeutic dosage forms in conventional vehicles 
for administration. For example, the compounds can be administered 
in such oral dosage forms as tablets, capsules (each including timed 

15 release and sustained release formulations), pills, powders, granules, 
elixirs, tinctures, solutions, suspensions, syrups and emulsions, or by 
injection. Likewise, they may also be administered in intravenous (both 
bolus and infusion), intraperitoneal, subcutaneous, topical with or 
without occlusion, or intramuscular form, all using forms well known 

20 to those of ordinary skill in the pharmaceutical arts. 

Advantageously, compounds of the present invention may 
be administered in a single daily dose, or the total daily dosage may be 
administered in divided doses of two, three or four times daily. 
Furthermore, compounds for the present invention can be administered 

25 in intranasal form via topical use of suitable intranasal vehicles, or via 
transdermal routes, using those forms of transdermal skin patches well 
known to those of ordinary skill in that art. To be administered in the 
form of a transdermal delivery system, the dosage administration will, 
of course, be continuous rather than intermittent throughout the dosage 

30 regimen. 

For combination treatment with more than one active 

agent, where the active agents are in separate dosage formulations, the 

active agents can be administered concurrently, or they each can be 

administered at separately staggered times. 

35 The dosage regimen utilizing the compounds of the present 

invention is selected in accordance with a variety of factors including 
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type, species, age, weight, sex and medical condition of the patient; the 
severity of the condition to be treated; the route of administration; the 
renal, hepatic and cardiovascular function of the patient; and the 
particular compound thereof employed. A physician or veterinarian of 

5 ordinary skill can readily determine and prescribe the effective amount 
of the drug required to prevent, counter or arrest the progress of the 
condition. Optimal precision in achieving concentrations of drug within 
the range that yields efficacy without toxicity requires a regimen based 
on the kinetics of the drug's availability to target sites. This involves a 

10 consideration of the distribution, equilibrium, and elimination of a 
drug. 

The following example is provided to illustrate the present 
invention without, however, limiting the same hereto. 

15 EXAMPLE 1: 

Isolation and Characterization of a DNA Molecule 
Encoding nNR7 

DNA fragments encoding DBD regions of androgen receptor 
20 (AR; Lubahn, et al., 1988, MoL Endocrinol. 2 (12): 1265-1275), estrogen 
receptor b (ERb; Mosselman, et al., 1996, FEES Lett. 392 (1): 49-53), 
glucocorticoid receptor (GR; Hollenberg, et al., 1985, Nature 318: 635-641) 
and vitamin D receptor (VDR; Baker, et al., 1988, Proc. Natl. Acad. Sci. 
(U.S.A) 85 (10): 3294-3298) were generated by PCR made from human 
25 fetal brain mRNA for AR, ERb and Grand human small intestine 
mRNA for VDR. These cDNA fragments were subcloned into pCR 
cloning vectors as described by the manufacturer. The following 
oligonucleotide primers were utilized to generate fragments the above- 
mentioned cDNA fragments for plasmid sub cloning: 
30 1. GR-R 5'-TTTCGAGCTTCCAGGTTCAT-3' (SEQ ID NO: 9) 

2. GR-F 5-CTCCCAAACTCTGCCTGGTG-3' (SEQ ID NO: 10) 

3. ERB-R 5-CGGGAGCCACACTTCACCAT-3' (SEQ ID NO: 11) 

4. ERB-F 5'-GCTCACTTCTGCGCTGTCTG-3' (SEQ ID NO: 12) 

5. AR-R 5'-TTCCGGGCTCCCAGAGTCAT-3' (SEQ ID NO: 13) 
35 6. AR-F 5'-CAGAAGACCTGCCTGATCTG-3' (SEQ ID NO: 14) 

7. VDR-R S'-GAAATGAACTCCTTCATCAT-S' (SEQ ID NO: 15) 

8. VDR-F 5-CCGGATCTGTGGGGTGTGTG-3'(SEQ ID NO: 16). 
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The DNA fragments were purified using a Qiagen gel extraction kit. 
Phosphorylation, self-ligation and transformation of the purified DNA 
was carried out as recommended by the manufacturer. 

These DBD probes were utilized to screen a mixed cDNA library. 
5 from adrenal, bone marrow, brain, fetal brain, heart, fetal kidney, 

liver, fetal liver, lung, fetal lung, mammary gland, pancreas, placenta, 
prostate, skeletal muscle, small intestine, spleen, testis, thyroid, 
thymus, and uterus tissue. The probes used in the screening were DNA 
fragments corresponding to AR, ERb, GR and VDR DNA binding 

10 domain regions and 250,000 primary clones from the mixed library were 
screened. After two rounds of screening, in vivo excision was carried 
out on the purified 1ZAPII phages which showed positive signals at low 
stringency condition. Plasmid DNA samples were submitted to 
sequence directly using M13 forward and reverse primers. Blast search 

15 of the sequence information obtained was done using computer program 
PhredPhrap. Clone gm6 was found to be a novel receptor. DNA 
sequencing was continued from both 5' and 3' ends on nNR7. An intron 
was identified when sequencing with pgm6y2 

(5*-CTTCAATGTCATGACATG-3'; SEQ ID NO; 3). A cDNA fragment 
20 flanking the intron region was retrieved via PCR using primer pair of 
NR7F (5'-CCAAATCTGCCGTGTATGTG-3'; SEQ ID NO; 4) and 
pgm6xC (S'-GTCAGTGCACTCTCCACGT-S'; SEQ ID NO; 5) followed by 
double nested primers of pgm6y2 and pgm6xD 

(S'-TGCAGCTGGTCCACCACGCG-S'; SEQ ID NO; 6). A prominent 
25 DNA fragment of -1.5 kb was amplified from testis and brain cDNAs. 
The fragment was purified and subcloned into pCRII vector (Invitrogen, 
San Diego, CA). Automated sequencing was performed on several of the 
clones. Sequence assembly and analysis were performed with 
SEQUENCHERTM 3.0 (Gene Codes Corporation, Ann Arbor, MI). 
30 Ambiguities and/or discrepancies between automated base calling in 
sequencing reads were visually examined and edited to the correct base 
call. The final nucleotide sequence encoding nNR7 is shown as set forth 
in Figure 1A-C and as set forth as SEQ ID NO: 1. The complete 
sequencing of the gm6 clone revealed a 3.1 kb cDNA which codes an 
35 authentic novel nuclear receptor. Primer pair pgm6xAR 

(5'-GGGTATGCTCTGTGACAAG-3'; SEQ ID NO; 7) and pgm6x 
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(S'-AGGCAGGCACTTTCATACC-S'; SEQ ID NO; 8) in the 3' non-coding 
region were used to scan the 83 clones of the Stanford radiation hybrid 
panel (Cox et al., 1990, Science, 250:245-250). The PCR results were 
scored and submitted to the Stanford Genome Center for linkage 

5 analysis. The result indicated that nNR7 was located on chromosome 3. 
Northern analysis on Clontech blots showed that nNR7 was mainly 
expressed in human liver at medium to low levels. The liver is the 
principal site to convert inert forms of D vitamin to active forms which 
are transported to target tissues, such as intestine, to activate VDR. 

10 nNR7 expressed in the liver may be activated by one of the vitamin D 
metabolites and perform other biological functions of D vitamins. In 
addition, as discussed herein, nNR7 is involved in regulation a CYP3A4. 



15 EXAMPLE 2: 

Isolation and Characterization of a DNA Molecule 
Encoding nNE7-l 

As disclosed within the specification, the human nNR7 cDNA 

20 does not contain a starting Met. Therefore, efforts were made to extend 
the nNR7 5'-end sequence. Three of the clones from 5'-end RACE (rapid 
amplification of cDNA end) showed divergence from the original one. 
One clone contains a starting Met with an inframe stop codon in front. 
This cDNA clone, nNR7-l, represents a complete open reading frame 

25 (ORF) for a human NR7-1 receptor protein. 

Oligonucleotide 5'-AAG CCC TTG CAT CCT TCA CA-3' (SEQ ID 
NO:19) was paired with primer API 5-CCA TCC TAA TAC GAC TCA 
CTA TAG GGC-3* SEQ ID NO:20; Clontech, Palo Alto, CA) to amplify 
nNR7 5' cDNA ends. The template used was human liver Marathon- 

30 Ready™ cDNA from Clontech. DNA product from one round of PCR 
reaction was gel purified using Qiagen gel extraction kit (Chatsworth, 
CA). The DNA fragment was then subcloned into PCRII™ TA vector 
(Invitrogen, Carlsbad, CA). Plasmid DNA from multiple clones were 
prepared and submitted for sequencing using the ABI PRISM^M dye 

35 terminator cycle sequencing ready reaction kit with AmpliTaq DNA 

polymerase, FS (Perkin Elmer, Norwalk, CT). The sequencing primers 
used were: TAF 5-GTA CCG AGC TCG GAT CCA CTA-3' (SEQ ID 
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NO: 21, TAR 5'-CCG CCA GTG TGA TGG ATA TCT-3' (SEQ ID NO:22) 
R7LA 5-CTC ATC TGC GTT GAC ACT GGG-3* (SEQ ID NO:23). 
Sequence assembly and analysis were performed with 
SEQUENCHERTM 3.0 (Gene Codes Corporation, Ann Arbor, MI). 
5 Ambiguities and/or discrepancies between automated base calling in 
sequencing reads were visually examined and edited to the correct base 
call. Clone 6.2 (refered herein as clone nNR7-l) results in a different 
open reading frame from the original one but with an in-frame Met and 
upstream stop codon. The site of divergence between SEQ ID NO:l and 

10 SEQ ID NO:17 (as discused in the Detailed Description of the Invention) 
is the same as for clone Fl and A2, the other two clones derived from 
this 5-RACE experiment. Therefore, it is likely that the cDNA clone 
which encodes nNR7 (SEQ ID NOs: 1 and 2) is a partially processed 
cDNA while clone 6.2 (nNR7-l; SEQ ID NOs: 1 and 2) is a completely 

15 processed cDNA. As noted in Example 1 for nNR7-l, nNR7-l is located 
on chromosome 3, near the 3ql3.2 locus. The nNR7-l clone is expressed 
in human liver at medium to low levels and within the small intestine. 
As noted throughout this specification, nNR7-l will be useful in assays 
to identify compounds which modulate CYP3A4 expression as well as 

20 possible interactions within Vitamin D metobolism. 



-42- 



WO 99/31129 



PCT/US98/26364 



WHAT IS CLAIMED: 

1. A purified DNA molecule encoding a human nNR7 
protein wherein said protein comprises the amino acid sequence as 

5 follows: 

SILCTGLFKV DPRGEVGAKN LPPSSPRGPE ANLEVRPKES WNHADFVHCE 
DTESVPGKPS VNADEEVGGP QICRVCGDKA TGYHFNVMTC EGCKGFFRRA 
MKRNARLRCP FRKGACEITR KTRRQCQACR LRKCLESGMK KEMIMSDEAV 
EERRALIKRK KSERTGTQPL GVQG LTEEQR MMIRELMDAQ MKTFDTTFSH 

10 FKNFRLPGVL SSGCELPESL QAPSREEAAK WSQVRKDLCS LKVSLQLRGE 
DGSVWNYKPP ADSGGKEIFS LLPHMADMST YMFKGIISFA KVISYFRDLP 
IEDQISLLKG AAFELCQLRF NTVFNAETGT WECGRLSYCL EDTAGGFQQL 
LLEPMLKFHY MLKKLQLHEE EYVLMQAISL FSPDRPGVLQ HRWDQLQEQ 
FAITLKSYIE CNRPQPAHRF LFLKIMAMLT ELRSINAQHT QRLLRIQDIH 

15 PFATPLMQEL FGITGS, as set forth in three-letter 
abbreviation in SEQ ID NO: 2. 

2. An expression vector for expressing a human nNR7 
protein in a recombinant host cell wherein said expression vector 

20 comprises a DNA molecule of claim 1. 

3. A host cell which expresses a recombinant human 
nNR7 protein wherein said host cell contains the expression vector of 
claim 2. 

25 

4. A process for expressing a human nNR7 protein in a 
recombinant host cell, comprising: 

(a) transfecting the expression vector of claim 2 into 
30 a suitable host cell; and, 

(b) culturing the host cells of step (a) under 
conditions which allow expression of said the human nNR7 protein 
from said expression vector. 

35 
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5. A purified DNA molecule encoding a human nNE7 
protein wherein said protein consists of the amino acid sequence as 
follows: 

SILCTGLFKV DPRGEVGAKN LPPSSPRGPE ANLEVRPKES WNHADFVHCE 
5 DTESVPGKPS VNADEEVGGP QICRVCGDKA TGYHFNVMTC EGCKGFFRRA 
MKRNARLRCP FRKGACEITR KTRRQCQACR LRKCLESGMK KEMIMSDEAV 
EERRALIKRK KSERTGTQPL GVQGLTEEQR MMIRELMDAQ MKTFDTTFSH 
FKNFRLPGVL SSGCELPESL QAPSREEAAK WSQVRKDLCS LKVSLQLRGE 
DGSWNYKPP ADSGGKEIFS LLPHMADMST YMFKGIISFA KVISYFRDLP 
10 IEDQISLLKG AAFELCQLRF NTVFNAETGT WECGRLSYCL EDTAGGFQQL 
LLEPMLKFHY MLKKLQLHEE EYVLMQAISL FSPDRPGVLQ HRWDQLQEQ 
FAITLKSYIE CNRPQPAHRF LFLKIMAMLT ELRSINAQHT QRLLRIQDIH 
PFATPLMQEL FGITGS , as set forth in three-letter abbreviation in 
SEQ ID NO: 2. 

15 

6. An expression vector for expressing a human nNR7 
protein in a recombinant host cell wherein said expression vector 
comprises a DNA molecule of claim 5. 

20 7. A host cell which expresses a recombinant human 

nNR7 protein wherein said host cell contains the expression vector of 
claim 6. 

8. A process for expressing a human nNR7 protein in a 
25 recombinant host cell, comprising: 

(a) transfecting the expression vector of claim 6 into 
a suitable host cell; and, 

30 (b) culturing the host cells of step (a) under 

conditions which allow expression of said the human nNR7 protein 
from said expression vector. 

9. A purified DNA molecule encoding a human nNR7 
35 protein wherein said DNA molecule consists of the nucleotide sequence 

as set forth in SEQ ID NO: 1, as follows: 
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TACGCCAAGC TCGAAATTAA 
CCACCGCGGT GGCGGCCGCT 
AATTCGAATT CTCATAACCT 
TTTACTTCAG TGGGAATCTC 
5 TGAGAAAAGC AAGAGAATAA 
GCTCCTTGGT AAAGCTACTC 
AAGTGGACCC CAGGGGAGAA 
CCAAGAGGCC CAGAAGCAAA 
CCATGCTGAC TTTGTACACT 

10 CCAGTGTCAA CGCAGATGAG 
TGTGGGGACA AGGCCACTGG 
ATGCAAGGGC TTTTTCAGGA 
GCCCCTTCCG GAAGGGCGCC 
TGCCAGGCCT GCCGCCTGCG 

15 GATGATCATG TCCGACGAGG 
GGAAGAAAAG TGAACGGACA 
ACAGAGGAGC AGCGGATGAT 
AACCTTTGAC ACTACCTTCT 
TGCTTAGCAG TGGCTGCGAG 

20 GAAGAAGCTG CCAAGTGGAG 
GGTCTCTCTG CAGCTGCGGG 
CCCCAGCCGA CAGTGGCGGG 
GCTGACATGT CAACCTACAT 
CATCTCCTAC TTCAGGGACT 

25 AGGGGGCCGC TTTCGAGCTG 
GCGGAGACTG GAACCTGGGA 
CACTGCAGGT GGCTTCCAGC 
ACTACATGCT GAAGAAGCTG 
CAGGCCATCT CCCTCTTCTC 

30 CGTGGTGGAC CAGCTGCAGG 
TTGAATGCAA TCGGCCCCAG 
ATGGCTATGC TCACCGAGCT 
GCTGCTGCGC ATCCAGGACA 
AGTTGTTCGG CATCACAGGT 

35 CGAGAGGCAG CCAGACCCAG 
ACAGATGGAC ACTGCCAAGA 



CCCTCACTAA AGGGAACAAA AGCTGGAGCT 
CTAGAACTAG TGGATCCCCC GGGCTGCAGG 
ATGACTAGGA CGGGAAGAGG AAGCACTGCC 
GGCCTCAGCC TGCAAGCCAA GTGTTCACAG 
GCTAATACTC CTGTCCTGAA CAAGGCAGCG 
CTTGATCGAT CCTTTGCACC GGATTGTTCA 
GTC GG AGCAA AGAACTTACC ACCAAGCAGT 
CCTGGAGGTG AGACCCAAAG AAAGCTGGAA 
GTGAGGACAC AGAGTCTGTT CCTGGAAAGC 
GAAGTCGGAG GTCCCCAAAT CTGCCGTGTA 
CTATCACTTC AATGTCATGA CATGTGAAGG 
GGGCCATGAA ACGCAACGCC CGGCTGAGGT 
TGCGAGATCA CCCGGAAGAC CCGGCGACAG 
CAAGTGCCTG GAGAGCGGCA TGAAGAAGGA 
CCGTGGAGGA GAGGCGGGCC TTGATCAAGC 
GGGACTCAGC CACTGGGAGT GCAGGGGCTG 
GATCAGGGAG CTGATGGACG CTCAGATGAA 
CCCATTTCAA GAATTTCCGG CTGCCAGGGG 
TTGCCAGAGT CTCTGCAGGC CCCATCGAGG 
CCAGGTC CGG AAAGATCTGT GCTCTTTGAA 
GGGAGGATGG CAGTGTCTGG AACTACAAAC 
AAAGAGATCT TCTCCCTGCT GCCCCACATG 
GTTCAAAGGC ATCATCAGCT TTGCCAAAGT 
TGCCCATCGA GGACCAGATC TCCCTGCTGA 
TGTCAACTGA GATTCAACAC AGTGTTCAAC 
GTGTGGCCGG CTGTCCTACT GCTTGGAAGA 
AACTTCTACT GGAGCCCATG CTGAAATTCC 
CAGCTGCATG AGGAGGAGTA TGTGCTGATG 
CCCAGACCGC CCAGGTGTGC TGCAGCACCG 
AGCAATTCGC CATTACTCTG AAGTCCTACA 
CCTGCTCATA GGTTCTTGTT CCTGAAGATC 
CCGCAGCATC AATGCTCAGC ACACCCAGCG 
TACACCCCTT TGCTACGCCC CTCATGCAGG 
AGCTGAGCGG CTGCCCTTGG GTGACACCTC 
AGCCCTCTGA GCCGCCACTC CCGGGCCAAG 
GCCGACAATG CCCTGCTGGC CTGTCTCCCT 
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AGGGAATTCC TGCTATGACA 
GTGCCCCCCA CCCCCAGTTC 
ACGTGGAGAG TGCACTGACC 
TGCCCTTTCC TTTTAAAAGG 
5 CCCACTAAAG TGTCAAGGTG 
ATCTGGGGTC TATGCCCACA 
TCATTGCTAC CTCTAATAGT 
CTCTTCCGAG CTGCTTTGTG 
ATGAGTATCT GTGGGAGTCC 

10 ACCAAATGTC AGAAGCTTGG 
GTCTCTGCAT CCATTTGAAC 
GCTGTGGGGT ATACAGCATT 
TTTATAGTTA AAAAAACAAA 
GAAATGATAA GTGACAAAAG 

15 GAGCTGTGAT GGC GGGCACT 
GAGTCTGTAG GAGCAAGGGC 
ATTTGGTGTA GGTAGGTCTG 
CTGGGGCTGG AATGCTGGGT 
AGTTAAACAC ACCGGAGAAG 

20 TACACATCTA TTCTCAAAGC 
TAGCCACTTG TGAGTAAAAA 
TATAAGGCAT TCCACACCTA 
TTTAATGTCA AATCAAGGCA 
AGGGGTAAAC TTTTTTGGCC 

25 GGAATTCGAA TTCGATATCA 
GGCCC GGTAC CCAATTCGCC 
(SEQ ID NO: 1) . 



GCTGGGTAGC ATTCCTCAGG AAGGACATGG 
AGTCTGTAGG GAGTGAAGCC ACAGATTCTT 
TGTAGGTCAG GACCATCAGA GAGGCAAGGT 
CCCTGTGGTC TGGGGAGAAA TCCCTCAGAT 
TGGAAGGGAC CAAGCGACCA AGGATAGGCC 
TACCCACGTT TGTTCGCTTC CTGAGTCTTT 
CCTGTCTCCC ACTTCCCACT CGTTCCCCTC 
GGCTCCAGGC CTGTACTCAT CGGCAGGTGC 
TCTAGAGAGA TGAGAAGCCA GGAGGCCTGC 
CATGACCTCA TTCCGGCCAC ATCATTCTGT 
ACATTATTAA GCACCGATAA TAGGTAGCCT 
GACTCAGATA TAGATCCTGA GCTCACAGAG 
CAGAAACACA AACAATTTGG ATCAAAAGGA 
CAGCACAAGG AATTTCCCTG TGTGGATGCT 
GGGTACCCAA GTGAAGGTTC CCGAGGACAT 
ACAAACTGCA GCTGTGAGTG CGTGTGTGTG 
TTTGCCACTT GATGGGGCCT GGG TTTGTTC 
ATGCTCTGTG ACAAGGCTAC GCTGACAATC 
AACCATTTAC ATGCACCTTA TATTTCTGTG 
TAAAGGGTAT GAAAGTGCCT GCCTTGTTTA 
TTTTTTTGCA TTTTCACAAA TTATACTTTA 
AGAACTAGTT TTGGGAAATG TAGCCCTGGG 
AAAGGAATTA AATAATGTAC TTTTGGCTAG 
TTTTTCTGGG GAAAATAATG TGGGGGTGTG 
AGCTTATCGA TACCGTCGAC CTCGAGGGGG 
CTATAGTGAG TCGTATTACA ATT 



10. A DNA molecule of claim 9 which consists of 
30 nucleotide 276 to about nucleotide 1676 of SEQ ID NO: 1. 



11. An expression vector for expressing a human nNR7 
protein wherein said expression vector comprises a DNA molecule of 
claim 9. 

35 
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12. An expression vector for expressing a human nNR7 
protein wherein said expression vector comprises a DNA molecule of 
claim 10. 

5 13. A host cell which expresses a recombinant human 

nNR7 protein wherein said host cell contains the expression vector of 
claim 11. 

14. A host cell which expresses a recombinant human 
10 nNH7 protein wherein said host cell contains the expression vector of 

claim 12. 

15. A process for expressing a human nNR7 protein in a 
recombinant host cell, comprising: 

15 

(a) transfecting the expression vector of claim 11 into 
a suitable host cell; and, 

(b) culturing the host cells of step (a) under 

20 conditions which allow expression of said the human nNR7 protein 
from said 

16. A purified human nNR7 protein which comprises 
the amino acid sequence as set forth in SEQ ID NO:2. 

25 

17. A purified human nNR7 protein of claim 16 which 
consists of the amino acid sequence as set forth in SEQ ID NO:2. 

18. A method for determining whether a substance is 
30 capable of binding to nNR7 comprising: 

(a) providing test cells by transfecting cells with an 
expression vector that directs the expression of nNR7 in the cells; 

(b) exposing the test cells to the substance; 

(c) measuring the amount of binding of the substance to 

35 nNR7; 
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(d) comparing the amount of binding of the substance to 
nNR7 in the test cells with the amount of binding of the substance to 
control cells that have not been transfected with nNR7. 



5 19. A method of deteraiining whether a substance acts 

as a modulator of nNR7 activity which comprises: 

(a) providing test cells by transfecting cells with a 
receptor expression vector that directs the expression of nNR7 in the 
cells; 

10 (b) providing test cells by transfecting the cells of step (a) 

with a second reporter expression vector that directs expression of a 
reporter gene under control of a regulatory element which is responsive 
tonNR7 

(b) exposing the test cells to the substance; 

15 (c) measuring the amount of binding of expression of the 

reporter gene; 

(d) comparing the amount of expression of the reporter 
gene in the test cells with the amount of expression of the reporter gene 
in control cells that has been transfected with a reporter vector of step (b) 
20 but not a receptor vector of step (a). 



20. A purified DNA molecule encoding a human nNR7 
protein wherein said protein comprises the amino acid sequence as 
follows: 

25 MTVTRTHHFK EGSLRAPAIP LHSAAAELAS NHPRGPEANL EVRPKESWNH 
ADFVHCEDTE SVPGKPSVNA DEEVGGPQIC RVCGDKATGY HFNVMTCEGC 
KGFFRRAMKR NARLRCPFRK GACEITRKTR RQCQACRLRK CLESGMKKEM 
IMSDEAVEER RALIKRKKSE RTGTQPLGVQ GLTEEQRMMI RELMDAQMKT 
FDTTFSHFKN FRLPGVLSSG CELPESLQAP SREEAAKWSQ VRKDLCSLKV 

30 SLQLRGEDGS VWNYKPPADS GGKEIFSLLP HMADMSTYMF KGIISFAKVI 
SYFRDLPIED QISLLKGAAF ELCQLRFNTV FNAETGTWEC GRLSYCLEDT 
AGGFQQLLLE PMLKFHYMLK KLQLHEEEYV LMQAISLFSP DRPGVLQHRV 
VDQLQEQFAI TLKSYIECNR PQPAHRFLFL KIMAMLTELR SINAQHTQRL 
LRIQDIHPFA TPLMQELFGI TGS, as set forth in the three-letter 

35 abbreviation in SEQ ID NO: 18. 
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21. An expression vector for expressing a human 
nNR7-l protein in a recombinant host cell wherein said expression 
vector comprises a DNA molecule of claim 20. 

5 22. A host cell which expresses a recombinant h uman 

nNR7-l protein wherein said host cell contains the expression vector of 
claim 21. 

23. A process for expressing a human nNR7-l protein in 
10 a recombinant host cell, comprising: 

(a) transfecting the expression vector of claim 21 into 
a suitable host cell; and, 

15 (b) culturing the host cells of step (a) under 

conditions which allow expression of said the human nNR7-l protein 
from said expression vector. 

24. A purified DNA molecule encoding a human nNR7-l 
20 protein wherein said protein consists of the amino acid sequence as 

follows: 

MTVTRTHHFK EGSLRAPAIP LHSAAAELAS NHPRGPEANL EVRPKESWNH 
ADFVHCEDTE SVPGKPSVNA DEEVGGPQIC RVCGDKATGY HFNVMTCEGC 
KGFFRRAMKR MARLRCPFRK GACEITRKTR RQCQACRLRK CLESGMKKEM 

25 IMSDEAVEER RALIKRKKSE RTGTQPLGVQ GLTEEQRMMI RELMDAQMKT 
FDTTFSfiFKN FRLPGVLSSG CELPESLQAP SREEAAKWSQ VRKDLCSLKV 
SLQLRGEDGS VWNYKPPADS GGKEIFSLLP HMADMSTYMF KGIISFAKVI 
SYFRDLPIED QISLLKGAAF ELCQLRFNTV FNAETGTWEC GRLSYCLEDT 
AGGFQQLLLE PMLKFHYMLK KLQLHEEEYV LMQAISLFSP DRPGVLQHRV 

30 VDQLQEQFAI TLKSYIECNR PQPAHRFLFL KIMAMLTELR SINAQHTQRL 
LRIQDIHPFA TPLMQELFGI TGS, as set forth in the three-letter 
abbreviation in SEQ ID NO: 18. 

25. An expression vector for expressing a human 
35 nNR7-lprotein in a recombinant host cell wherein said expression 

vector comprises a DNA molecule of claim 24. 
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26. A host cell which expresses a recombinant human 
nNR7-l protein wherein said host cell contains the expression vector of 
claim 25. 

5 

27. A process for expressing a human nNR7-l protein in 
a recombinant host cell, comprising: 

(a) transfecting the expression vector of claim 25 into 
10 a suitable host cell; and, 

(b) culturing the host cells of step (a) under 
conditions which allow expression of said the human nNR7-l protein 
from said expression vector. 

15 

28. A purified DNA molecule encoding a human nNR7-l 
protein wherein said DNA molecule consists of the nucleotide sequence 
as set forth in SEQ ID NO: 17, as follows: 

TCCATCCTAA TACGACTCAC TATAGGGCTC GAGCGGCCGC CCGGGCAGGT 

20 CTTTTGGCCT GCTGGGTTAG TGCTGGCAGC CCCCTGAGGC CAAGGACAGC 
AGCATGACAG TCACCAGGAC TCACCACTTC AAGGAGGGGT CCCTCAGAGC 
ACCTGCCATA CCCCTGCACA GTGCTGCGGC TGAGTTGGCT TCAAACCATC 
CAAGAGGCCC AGAAGCAAAC CTGGAGGTGA GACCCAAAGA AAGCTGGAAC 
CATGCTGACT TTGTACACTG TGAGGACACA GAGTCTGTTC CTGGAAAGCC 

25 CAGTGTCAAC GCAGATGAGG AAGTCGGAGG TCCCCAAATC TGCCGTGTAT 
GTGGGGACAA GGCCACTGGC TATCACTTCA ATGTCATGAC ATGTGAAGGA 
TGCAAGGGCT TTTTCAGGAG GGCCATGAAA CGCAACGCCC GGCTGAGGTG 
CCCCTTCCGG AAGGGCGCCT GCGAGATCAC CCGGAAGACC CGGCGACAGT 
GCCAGGCCTG CCGCCTGCGC AAGTGCCTGG AGAGCGGCAT GAAGAAGGAG 

30 ATGATCATGT CCGACGAGGC CGTGGAGGAG AGGCGGGCCT TGATCAAGCG 
GAAGAAAAGT GAACGGACAG GGACTCAGCC ACTGGGAGTG CAGGGGCTGA 
CAGAGGAGCA GCGGATGATG ATCAGGGAGC TGATGGACGC TCAGATGAAA 
ACCTTTGACA CTACCTTCTC CCATTTCAAG AATTTCCGGC TGC CAGGGGT 
GCTTAGCAGT GGCTGCGAGT TGCCAGAGTC TCTGCAGGCC CCATCGAGGG 

35 AAGAAGCTGC CAAGTGGAGC CAGGTCCGGA AAGATCTGTG CTCTTTGAAG 
GTCTCTCTGC AGCTGCGGGG GGAGGATGGC AGTGTCTGGA ACTACAAACC 
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CCCAGCCGAC AGTGGCGGGA 
CTGACATGTC AACCTACATG 
ATCTCCTACT TCAGGGACTT 
GGGGGCCGCT TTCGAGCTGT 
5 CGGAGACTGG AACCTGGGAG 
ACTGCAGGTG GCTTCCAGCA 
CTACATGCTG AAGAAGCTGC 
AGGCCATCTC CCTCTTCTCC 
GTGGTGGACC AGCTGCAGGA 

10 TGAATGCAAT CGGCCCCAGC 
TGGCTATGCT CACCGAGCTC 
CTGCTGCGCA TCCAGGACAT 
GTTGTTCGGC ATCACAGGTA 
GAGAGGCAGC CAGACCCAGA 

15 CAGATGGACA CTGCCAAGAG 
GGGAATTCCT GCTATGACAG 
TGCCCCCCAC CCCCAGTTCA 
CGTGGAGAGT GCACTGACCT 
GCCCTTTCCT TTTAAAAGGC 

20 CCACTAAAGT GTCAAGGTGT 
TCTGGGGTCT ATGCCCACAT 
CATTGCTACC TCTAATAGTC 
TCTTCCGAGC TGCTTTGTGG 
TGAGTATCTG TGGGAGTCCT 

25 CCAAATGTCA GAAGCTTGGC 
TCTCTGCATC CATTTGAACA 
CTGTGGGGTA TACAGCATTG 
TTATAGTTAA AAAAACAAAC 
AAATGATAAG TGACAAAAGC 

30 AGCTGTGATG GCGGGCACTG 
AGTCTGTAGG AGCAAGGGCA 
TTTGGTGTAG GTAGGTCTGT 
TGGGGCTGGA ATGCTGGGTA 
GTTAAACACA CCGGAGAAGA 

35 ACACATCTAT TCTCAAAGCT 
AGCCACTTGT GAGTAAAAAT 



AAGAGATCTT CTCCCTGCTG CCCCACATGG 
TTCAAAGGCA TCATCAGCTT TGCCAAAGTC 
GCCCATCGAG GACCAGATCT CCCTGCTGAA 
GTCAACTGAG ATTCAACACA GTGTTCAACG 
TGTGGCCGGC TGTCCTACTG CTTGGAAGAC 
ACTTCTACTG GAGCCCATGC TGAAATTCCA 
AGCTGCATGA GGAGGAGTAT GTGCTGATGC 
CCAGACCGCC CAGGTGTGCT GCAGCACCGC 
GCAATTCGCC ATTACTCTGA AGTCCTACAT 
CTGCTCATAG GTTCTTGTTC CTGAAGATCA 
CGCAGCATCA ATGCTCAGCA CACCCAGCGG 
ACACCCCTTT GCTACGCCCC TCATGCAGGA 
GCTGAGCGGC TGCCCTTGGG TGACACCTCC 
GCCCTCTGAG CCGCCACTCC CGGGCCAAGA 
CCGACAATGC CCTGCTGGCC TGTCTCCCTA 
CTGGCTAGCA TTCCTCAGGA AGGACATGGG 
GTCTGTAGGG AGTGAAGCCA CAGATTCTTA 
GTAGGTCAGG ACCATCAGAG AGGCAAGGTT 
CCTGTGGTCT GGGGAGAAAT CCCTCAGATC 
GGAAGGGACC AAGCGACCAA GGATAGGCCA 
ACCCACGTTT GTTCGCTTCC TGAGTCTTTT 
CTGTCTCCCA CTTCCCACTC GTTCCCCTCC 
GCTCCAGGCC TGTACTCATC GGCAGGTGCA 
CTAGAGAGAT GAGAAGCCAG GAGGCCTGCA 
ATGACCTCAT TCCGGCCACA TCATTCTGTG 
CATTATTAAG CACCGATAAT AGGTAGCCTG 
ACTCAGATAT AGATCCTGAG CTCACAGAGT 
AGAAACACAA ACAATTTGGA TCAAAAGGAG 
AGCACAAGGA ATTTCCCTGT GTGGATGCTG 
GGTACCCAAG TGAAGGTTCC CGAGGACATG 
CAAACTGCAG CTGTGAGTGC GTGTGTGTGA 
TTGCCACTTG ATGGGGCCTG GGTTTGTTCC 
TGCTCTGTGA CAAGGCTACG CTGACAATCA 
ACCATTTACA TGCACCTTAT ATTTCTGTGT 
AAAGGGTATG AAAGTGCCTG CCTTGTTTAT 
TTTTTTGCAT TTTCACAAAT TATACTTTAT 
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ATAAGGCATT CCACACCTAA GAACTAGTTT TGGGAAATGT AGCCCTGGGT 
TTAATGTCAA ATCAAGGCAA AAGGAATTAA ATAATGTACT TTTGGCTAGA 
GGGGTAAACT TTTTTGGCCT TTTTCTGGGG AAAATAATGT GGGGGTGTGG 
(SEQ ID NO: 17) . 

5 

29. A DNA molecule of claim 28 which consists of 
nucleotide 104 to about nucleotide 1525 of SEQ ID NO: 17. 

30. An expression vector for expressing a human 
10 nNR7-l protein wherein said expression vector comprises a DNA 

molecule of claim 28. 

31. An expression vector for expressing a human 
nNR7-l protein wherein said expression vector comprises a DNA 

15 molecule of claim 29. 

32. A host cell which expresses a recombinant human 
nNR7-l protein wherein said host cell contains the expression vector of 
claim 30. 

20 

33. A host cell which expresses a recombinant human 
nNR7-l protein wherein said host cell contains the expression vector of 
claim 31. 

25 34. A process for expressing a human nNR7 protein in a 

recombinant host cell, comprising: 

(a) transfecting the expression vector of claim 29 into 
a suitable host cell; and, 



30 



(b) culturing the host cells of step (a) under 
conditions which allow expression of said the human nNR7-l protein 
from said expression vector. 
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35. A purified human nNR7-l protein which comprises 
the amino acid sequence as set forth in SEQ ED NO: 18. 

5 36. A purified human nNR7-l protein of claim 35 which 

consists of the amino acid sequence as set forth in SEQ ID NO: 18. 

37. A method for determining whether a substance is 
capable of binding to nNR7-l comprising: 

10 (a) providing test cells by transfecting cells with an 

expression vector that directs the expression of nNR7-l in the cells; 

(b) exposing the test cells to the substance; 

(c) measuring the amount of binding of the substance to 

nNR7-l; 

15 (d) comparing the amount of binding of the substance to 

nNR7-l in the test cells with the amount of binding of the substance to 
control cells that have not been transfected with nNR7-l. 

38. A method of determining whether a substance acts 
20 as a modulator of nNR7-l activity which comprises: 

(a) providing test cells by transfecting cells with a 
receptor expression vector that directs the expression of nNR7-l in the 
cells; 

(b) providing test cells by transfecting the cells of step (a) 
25 with a second reporter expression vector that directs expression of a 

reporter gene under control of a regulatory element which is responsive 
tonNR7-l 

(b) exposing the test cells to the substance; 

(c) measuring the amount of binding of expression of the 
30 reporter gene; 

(d) comparing the amount of expression of the reporter 
gene in the test cells with the amount of expression of the reporter gene 
in control cells that has been transfected with a reporter vector of step (b) 
but not a receptor vector of step (a). 

35 
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TACGCCAAGC 


TCGAAATTAA 


CCCTCACTAA AGGGAACAAA AGCTGGAGCT 


CCACCGCGGT 


GGCGGCCGCT 


CTAGAACTAG TGGATCCCCC GGGCTGCAGG 


AATTCGAATT 


CTCATAACCT 


ATGACTAGGA CGGGAAGAGG AAGCACTGCC 


TTTACTTCAG 


TGGGAATCTC 


GGCCTCAGCC TGCAAGCCAA GTGTTCACAG 


TGAGAAAAGC 


AAGAGAATAA 


GCTAATACTC CTGTCCTGAA CAAGGCAGCG 


GCTCCTTGGT 


AAAGCTACTC 


CTTGATCGAT CCTTTGCACC GGATTGTTCA 


AAGTGGACCC 


CAGGGGAGAA 


O ~T~ O O O a O A A * O A A O 1 f A O O A O O A A O O A "T~ 

GTCGGAGCAA AGAACTTACC ACCAAGCAGT 


CCAAGAGGCC 


CAGAAGCAAA 


CCTGGAGGTG AGACCCAAAG AAAGCTGGAA 


CCATGCTGAC 


TTTGTACACT 


GTGAGGACAC AGAGTCTGTT CCTGGAAAGC 


CCAGTGTCAA 


CGCAGATGAG 


j-\ a ft » t" O /"\ a /"i /~i -r* O O O O A a a T* O "T O O OO^^TO 5 A 

GAAGTCGGAG GTCCCCAAAT CTGCCGTGTA 


TGTGGGGACA 


AGGCCACTGG 


CTATCACTTC AATGTCATGA CATGTGAAGG 


ATGCAAGGGC 


TTTTTCAGGA 


GGGCCATGAA ACGCAACGCC CGGCTGAGGT 


GCCCCTTCCG 


GAAGGGCGCC 


TGCGAGATCA CCCGGAAGAC CCGGCGACAG 


TGCCAGGCCT 


GCCGCCTGCG 


CAAGTGCCTG GAGAGCGGCA TGAAGAAGGA 


GATGATCATG 


TCCGACGAGG 


CCGTGGAGGA GAGGCGGGCC TTGATCAAGC 


GGAAGAAAAG 


TGAACGGACA 


GGGACTCAGC CACTGGGAGT GCAGGGGCTG 


ACAGAGGAGC 


AGCGGATGAT 


GATCAGGGAG CTGATGGACG CTCAGATGAA 


AACCTTTGAC 


ACTACCTTCT 


CCCATTTCAA GAATTTCCGG CTGCCAGGGG 


TGCTTAGCAG 


TGGCTGCGAG 


TTGCCAGAGT CTCTGCAGGC CCCATCGAGG 


GAAGAAGCTG 


CCAAGTGGAG 


CCAGGTCCGG AMGATCTGT GCTCTTTGAA 


GGTCTCTCTG 


CAGCTGCGGG 


GGGAGGATGG CAGTGTCTGG AACTACAAAC 


CCCCAGCCGA 


CAGTGGCGGG 


AAAGAGATCT TCTCCCTGCT GCCCCACATG 


GCTGACATGT 


CAACCTACAT 


GTTCAMGGC ATCATCAGCT TTGCCAAAGT 


CATCTCCTAC 


TTCAGGGACT 


TGCCCATCGA GGACCAGATC TCCCTGCTGA 
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AGGGGGCCGC 


TTTCGAGCTG 


TGTCAACTGA GATTCAACAC 


AGTGTTCAAC 


GCGGAGACTG 


GAACCTGGGA 


GTGTGGCCGG CTGTCCTACT 


GCTTGGAAGA 


CACTGCAGGT 


GGCTTCCAGC 


AACTTCTACT GGAGCCCATG 


CTGAAATTCC 


ACTACATGCT 


GAAGAAGCTG 


CAGCTGCATG AGGAGGAGTA 


TGTGCTGATG 


CAGGCCATCT 


CCCTCTTCTC 


CCCAGACCGC CCAGGTGTGC 


TGCAGCACCG 


CGTGGTGGAC 


CAGCTGCAGG 


AGCAATTCGC CATTACTCTG 


AAGTCCTACA 


TTGAATGCAA 


TCGGCCCCAG 


CCTGCTCATA GGTTCTTGTT 


CCTGAAGATC 


ATGGCTATGC 


TCACCGAGCT 


CCGCAGCATC AATGCTCAGC 


ACACCCAGCG 


GCTGCTGCGC 


ATCCAGGACA 


TACACCCCTT TGCTACGCCC 


CTCATGCAGG 


AGTTGTTCGG 


CATCACAGGT 


AGCTGAGCGG CTGCCCTTGG 


GTGACACCTC 


CGAGAGGCAG 


CCAGACCCAG 


AGCCCTCTGA GCCGCCACTC 


CCGGGCCAAG 


ACAGATGGAC 


ACTGCCAAGA 


GCCGACAATG CCCTGCTGGC 


CTGTCTCCCT 


AGGGAATTCC 


TGCTATGACA 


GCTGGCTAGC ATTCCTCAGG 


AAGGACATGG 


GTGCCCCCCA 


CCCCCAGTTC 


AGTCTGTAGG GAGTGAAGCC 


ACAGATTCTT 


ACGTGGAGAG 


TGCACTGACC 


TGTAGGTCAG GACCATCAGA 


GAGGCAAGGT 


TGCCCTTTCC 


TTTTAAAAGG 


CCCTGTGGTC TGGGGAGAAA 


TCCCTCAGAT 


CCCACTAMG 


TGTCAAGGTG 


TGGAAGGGAC CAAGCGACCA 


AGGATAGGCC 


ATCTGGGGTC 


TATGCCCACA 


TACCCACGTT TGTTCGCTTC 


CTGAGTCTTT 


TCATTGCTAC 


CTCTAATAGT 


CCTGTCTCCC ACTTCCCACT 


CGTTCCCCTC 


CTCTTCCGAG 


CTGCTTTGTG 


GGCTCCAGGC CTGTACTCAT 


CGGCAGGTGC 


ATGAGTATCT 


GTGGGAGTCC 


TCTAGAGAGA TGAGAAGCCA 


GGAGGCCTGC 


ACCAAATGTC 


AGAAGCTTGG 


CATGACCTCA TTCCGGCCAC 


ATCATTCTGT 


GTCTCTGCAT 


CCATTTGAAC 


ACATTATTAA GCACCGATAA 


TAGGTAGCCT 
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\JL( 1 u 1 UUUu 1 


ATAfAGfATT 


GACTCAGATA 


TAGATCCTGA 


GCTCACAGAG 


TTTATAGTTA 


AAAAAACAAA 


CAGAAACACA 


AACAATTTGG 


ATCAAAAGGA 


GAAATGATAA 


GTGACAAAAG 


CAGCACAAGG 


AATTTCCCTG 


TGTGGATGCT 


GAGCTGTGAT 


GGCGGGCACT 


GGGTACCCAA 


GTGAAGGTTC 


CCGAGGACAT 


GAGTCTGTAG 


GAGCAAGGGC 


ACAAACTGCA 


GCTGTGAGTG 


CGTGTGTGTG 


ATTTGGTGTA 


GGTAGGTCTG 


TTTGCCACTT 


GATGGGGCCT 


GGGTTTGTTC 


CTGGGGCTGG 


AATGCTGGGT 


ATGCTCTGTG 


ACAAGGCTAC 


GCTGACAATC 


AGTTAAACAC 


ACCGGAGAAG 


AACCATTTAC 


ATGCACCTTA 


TATTTCTGTG 


TACACATCTA 


TTCTCAAAGC 


TAAAGGGTAT 


GAAAGTGCCT 


GCCTTGTTTA 


TAGCCACTTG 


TGAGTAAAAA 


TTTTTTTGCA 


TTTTCACAAA 


TTATACTTTA 


TATAAGGCAT 


TCCACACCTA 


AGAACTAGTT 


TTGGGAAATG 


TAGCCCTGGG 


TTTAATGTCA 


AATCAAGGCA 


AAAGGAATTA 


AATAATGTAC 


TTTTGGCTAG 


w 

AGGGGTAAAC 


TTTTTTGGCC 


TTTTTCTGGG 


GAAAATAATG 


TGGGGGTGTG 


GGMTTCGAA 


TTCGATATCA 


AGCTTATCGA 


TACCGTCGAC 


CTCGAGGGGG 


GGCCCGGTAC 


CCAATTCGCC 


CTATAGTGAG 


TCGTATTACA 


ATT (SEQ II 
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1 TACGCCAAGCTCGAAATTAACCCTCACTAAAGGuAACAAAAGCTGGAGCTCCACCGCGGT 60 

61 GGCGGCCGCTCTAGAACTAGTGGATCCCCCGGGCTGCAGGAATTCGAATTCTCATAACCT 120 

121 ATGACTAGGACGGGMGAGGAAGCACTGCCTTTACTTCAGTGGGAATCTCGGCCTCAGCC 180 

181 TGCAAGCCAAGTGTTCACAGTGAGAAMGCAAGAGAATAAGCTAATACTCCTGTCCTGAA 240 

241 CAAGGCAGCGGCTCCTTGGTAAAGCTACTCCTTGATCGATCCTTTGCACCGGATTGTTCA 300 

SILCTGLFK 

301 AAGTGGACCCCAGGGGAGAAGTCGGAGCAAAGAACTTACCACCAAGCAGTCCAAGAGGCC 360 
VDPRGEVGAKNLPPSSPRGP 

361 CAGAAGCAAACCTGGAGGTGAGACCCAAAGAAAGCTGGAACCATGCTGACTTTGTACACT 420 
EANLEVRPKESWNHADFVHC 

42 1 GTGAGGACACAGAGTCTGTTCCTGGAAAGCCCAGTGTCAACGCAGATGAGGAAGTCGGAG 480 
EDTESVPGKPSVNADEEVGG 

481 GTCCCCAAATCTGCCGTGTATGTGGGGACAAGGCCACTGGCTATCACTTCAATGTCATGA 540 
P Q I CRVCGDKATGYHFNVMT 

541 C ATGTGAAGGATGCAAGGGCTTTTTCAGGAGGGC CATGAAACGCAAC GCCCGGCTGAGGT 600 
CEGCKGFFRRAM KRNARLRC 

601 GCCCCTTCCGGAAGGGCGCCTGCGAGATCACCCGGMGACCCGGCGACAGTGCCAGGCCT 660 
PFRKGACEITRKTRRQCQAC 

661 GCCGCCTGCGCAAGTGCCTGGAGAGCGGCATGAAGAAGGAGATGATCATGTCCGACGAGG 720 
RLRKCLESGM KKE MIMSDEA 

721 CCGTGGAGGAGAGGCGGGCCTTGATCAAGCGGAAGAAAAGTGAACGGACAGGGACTCAGC 780 
VEERRAL IKRKKSERTGTQP 

781 CACTGGGAGTGCAGGGGCTGACAGAGGAGCAGCGGATGATGATCAGGGAGCTGATGGACG 840 
LGVQGLTEEQRMMIRELMDA 

841 CTCAGATGAAAACCTTTGACACTACCTTCTCCCATTTCAAGAATTTCCGGCTGCCAGGGG 900 
QMKTFDTTFSHFKNFRLPGV 

90 1 TGCTTAGCAGTGGCTGCGAGTTGCCAGAGTCTCTGCAGGCCCCATCGAGGGAAGAAGCTG 960 
LSSGCELPESLQAPSREEAA 
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961 CCAAGTGGAGCCAGGTCCGGAAAGATCTGTGCTCTTTGAAGGTCTCTCTGCAGCTGCGGG 1020 
KWSQVRKDLCSLKVSLQLRG 

1021 GGGAGGATGGCAGTGTCTGGMCTACAAACCCCCAGCCGACAGTGGCGGGAAAGAGATCT 1080 
EDGSVWNYKPPADSGGKE I F 

108 1 TCTCCCTGCTGCCCCACATGGCTGACATGTCAACCTACATGTTCAAAGGCATCATCAGCT 1140 
SLLPHMADMSTYMFKGI ISF 

1141 TTGCCAAAGTCATCTCCTACTTCAGGGACTTGCCCATCGAGGACCAGATCTCCCTGCTGA 1200 
AKVISYFRDLPIEDQISLLK 

1201 AGGGGGCCGCTTTCGAGCTGTGTCAACTGAGATTCAACACAGTGTTCAACGCGGAGACTG 1260 
GAAFELCQLRFNTVFNAETG 

1261 GMCCTGGGAGTGTGGCCGGCTGTCCTACTGCTTGGAAGACACTGCAGGTGGCTTCCAGC 1320 
TWECGRLSYCLEDTAGGFQQ 

1321 AACTTCTACTGGAGCCCATGCTGAAATTCCACTACATGCTGAAGAAGCTGCAGCTGCATG 1380 
LLLEPMLKFHYMLKKLQLHE 

1381 AGGAGGAGTATGTGCTGATGCAGGCCATCTCCCTCTTCTCCCCAGACCGCCCAGGTGTGC 1440 
EEYVLMQAISLF5PDRPGVL 

1441 TGCAGCACCGCGTGGTGGACCAGCTGCAGGAGCAATTCGCCATTACTCTGAAGTCCTACA 1500 
QHRVVDQLQEQFAITLKSYI 

1 50 1 TTGAATGCAATCGGCCCCAGCCTGCTCATAGGTTCTTGTTCCTGAAGATCATGGCTATGC 1 560 
ECNRPQPAHRFLFLKIMAML 

1561 TCACCGAGCTCCGCAGCATCAATGCTCAGCACACCCAGCGGCTGCTGCGCATCCAGGACA 1620 
TELRS INAQHTQRLLR IQD I 

1621 TACACCCCTTTGCTACGCCCCTCATGCAGGAGTTGTTCGGCATCACAGGTAGCTGAGCGG 1 680 
HPFATPLMQELFGITGS (SEQ ID NO:2) 

1681 CTGCCCTTGGGTGACACCTCCGAGAGGCAGCCAGACCCAGAGCCCTCTGAGCCGCCACTC 1740 

1741 CCGGGCCAAGACAGATGGACACTGCCAAGAGCCGACAATGCCCTGCTGGCCTGTCTCCCT 1800 
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1801 


AGGGAATTCCTGCTATGACAGCTGGCTAGCATTCCTCAGGAAG6ACATGGGTGCCCCCCA 


I860 


1861 


CCCCCAGTTCAGTCTGTAGGGAGTGAAGCCACAGATTCTTACGTGGAGAGTGCACTGACC 


1920 


1921 


TGTAGGTCAGGACCATCAGAGAGGCAAGGTTGCCCTTTCCTTTTAAAAGGCCCTGTGGTC 


1980 


1981 


TGGGGAGAAATCCCTCAGATCCCACTAAAGTGTCAAGGTGTGGAAGGGACCAAGCGACCA 


2040 


2041 


AGGATAGGCCATCTGGGGTCTATGCCCACATACCCACGTTTGTTCGCTTCCTGAGTCTTT 


2100 


2101 


TCATTGCTACCTCTAATAGTCCTGTCTCCCACTTCCCACTCGTTCCCCTCCTCTTCCGAG 


2160 


2161 


CTGCTTTGTGGGCTCCAGGCCTGTACTCATCGGCAGGTGCATGAGTATCTGTGGGAGTCC 


2220 


2221 


TCTAGAGAGATGAGAAGCCAGGAGGCCTGCACCAAATGTCAGAAGCTTGGCATGACCTCA 


2280 


2281 


TTCCGGCCACATCATTCTGTGTCTCTGCATCCATTTGAACACATTATTAAGCACCGATAA 


2340 


2341 


TAGGTAGCCTGCTGTGGGGTATACAGCATTGACTCAGATATAGATCCTGAGCTCACAGAG 


2400 


2401 


TTTATAGTTAAAAAAACAMCAGAAACACAAACAATTTGGATCAAAAGGAGAAATGATAA 


2460 


2461 


GTGACAAMGCAGCACAAGGAATTTCCCTGTGTGGATGCTGAGCTGTGATGGCGGGCACT 


2520 


2521 


GGGTACCCAAGTGAAGGTTCCCGAGGACATGAGTCTGTAGGAGCMGGGCACAAACTGCA 


2580 


2581 


GCTGTGAGTGCGTGTGTGTGATTTGGTGTAGGTAGGTCTGTTTGCCACTTGATGGGGCCT 


2640 


2641 


GGGTTTGTTCCTGGGGCTGGAATGCTGGGTATGCTCTGTGACAAGGCTACGCTGACAATC 


2700 


2701 


AGTTAAACACACCGGAGAAGAACCATTTACATGCACCTTATATTTCTGTGTACACATCTA 


2760 


2761 


TTCTCAAAGCTAAAGGGTATGAAAGTGCCTGCCTTGTTTATAGCCACTTGTGAGTAAAAA 


2820 


2821 


TTTTTTTGCATTTTCACAAATTATACTTTATATAAGGCATTCCACACCTAAGAACTAGTT 


2880 


2881 


TTGGGAAATGTAGCCCTGGGTTTAATGTCAAATCAAGGCAAAAGGAATTAAATAATGTAC 


2940 


2941 


TTTTGGCTAGAGGGGTAAACTTTTTTGGCCTTTTTCTGGGGAAAATAATGTGGGGGTGTG 


3000 


3001 


GGAATTCGAATTCGATATCAAGCTTATCGATACCGTCGACCTCGAGGGGGGGCCCGGTAC 


3060 


3061 


CCAATTCGCCCTATAGTGAGTCGTATTACAATT CSEQ ID NO : 1 ) 


3093 




FIG.2C 
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5ILCTGLFKV DPRGEVGAKN LPPSSPRGPE ANLEVRPKES WNHADFVHCE 
DTESVPGKPS VNADEEVGGP OI CRVCGDKA TGYH FNVMTC EGCKGFFRRA 
MKRNARI Rf.P FRKGACEITR KTRRQCQACR LRKCLESGM K KEMIMSDEAV 
EERRALIKRK KSERTGTQPL GVQGLTEEQR MMIRELMDAQ MKTFDTTFSH 
FKNFRLPGVL SSGCELPESL QAPSREEAAK WSQVRKDLCS LKVSLQLRGE 
DGSVWNYKPP ADSGGKEIFS LLPHMADMST YMFKGI ISFA KVISYFRDLP 
IEDQISLLKG AAFELCQLRF NTVFNAETGT WECGRLSYCL EDTAGGFQQL 
LLEPMLKFHY MLKKLQLHEE EYVLMQAISL FSPDRPGVLQ HRVVDQLQEQ 
FAITLKSYIE CNRPQPAHRF LFLKIMAMLT ELRSINAQHT QRLLRIQDIH 
PFATPLMQEL FGITG5 (SEQ ID N0:2) 

FIG. 3 
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1 TRRATRRTAA TACGACTCAC TATAGGGr.Tr RART.mrr.fir. CrGGGRAGGT 
51 CTTTTGGf.CT GCT GGGTTAG TGCTfi fiCAfiC GRRRTGAGfir CAAGGARAGG 
101 AGGATGARAG TRARRAGGAG TRARRACTTl AAGGAGGGGT CCCTCAfiAGr 
151 ARRTGGRATA CCCCTGCACA GTGCTGCGGt" TGAGTTGGRT TRAAACrA Tr. 



201 


CAAGAGGCCC 


AGAAGCAAAC 


CTGGAGGTGA 


GACCCAAAGA 


AAGCTGGAAC 


251 


CATGCTGACT 


TTGTACACTG 


TGAGGACACA 


GAGTCTGTTC 


CTGGAAAGCC 


301 


CAGTGTCAAC 


GCAGATGAGG 


AAGTCGGAGG 


TCCCCAAATC 


TGCCGTGTAT 


351 


GTGGGGACAA 


GGCCACTGGC 


TATCACTTCA 


ATGTCATGAC 


ATGTGAAGGA 


401 


TGCAAGGGCT 


TTTTCAGGAG 


GGCCATGAAA 


CGCAACGCCC 


GGCTGAGGTG 


451 


CCCCTTCCGG 


AAGGGCGCCT 


GCGAGATCAC 


CCGGAAGACC 


CGGCGACAGT 


501 


GCCAGGCCTG 


CCGCCTGCGC 


AAGTGCCTGG 


AGAGCGGCAT 


GAAGAAGGAG 


551 


ATGATCATGT 


CCGACGAGGC 


CGTGGAGGAG 


AGGCGGGCCT 


TGATCAAGCG 


601 


GAAGAAAAGT 


GAACGGACAG 


GGACTCAGCC 


ACTGGGAGTG 


CAGGGGCTGA 


651 


CAGAGGAGCA 


GCGGATGATG 


ATCAGGGAGC 


TGATGGACGC 


TCAGATGAAA 


701 


ACCTTTGACA 


CTACCTTCTC 


CCATTTCAAG 


AATTTCCGGC 


TGCCAGGGGT 


751 


GCTTAGCAGT 


GGCTGCGAGT 


TGCCAGAGTC 


TCTGCAGGCC 


CCATCGAGGG 


801 


AAGAAGCTGC 


CAAGTGGAGC 


CAGGTCCGGA 


AAGATCTGTG 


CTCTTTGAAG 


851 


GTCTCTCTGC 


AGCTGCGGGG 


GGAGGATGGC 


AGTGTCTGGA 


ACTACAAACC 


901 


CCCAGCCGAC 


AGTGGCGGGA 


AAGAGATCTT 


CTCCCTGCTG 


CCCCACATGG 


951 


CTGACATGTC 


AACCTACATG 


TTCAAAGGCA 


TCATCAGCTT 


TGCCAAAGTC 


1001 


ATCTCCTACT 


TCAGGGACTT 


GCCCATCGAG 


GACCAGATCT 


CCCTGCTGAA 


1051 


GGGGGCCGCT 


T7CGAGCTGT 


GTCAACTGAG 


ATTCAACACA 


GTGTTCAACG 



FIG.4A 
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1101 CGGAGACTGG AACCTGGGAG TGTGGCCGGC TGTCCTACTG CTTGGAAGAC 
1151 ACTGCAGGTG GCTTCCAGCA ACTTCTACTG GAGCCCATGC TGAAATTCCA 
1201 CTACATGCTG AAGAAGCTGC AGCTGCATGA GGAGGAGTAT GTGCTGATGC 
1251 AGGCCATCTC CCTCTTCTCC CCAGACCGCC CAGGTGTGCT GCAGCACCGC 
1301 GTGGTGGACC AGCTGCAGGA GCAATTCGCC ATTACTCTGA AGTCCTACAT 
1351 TGAATGCAAT CGGCCCCAGC CTGCTCATAG GTTCTTGTTC CTGAAGATCA 
1401 TGGCTATGCT CACCGAGCTC CGCAGCATCA ATGCTCAGCA CACCCAGCGG 
1451 CTGCTGCGCA TCCAGGACAT ACACCCCTTT GCTACGCCCC TCATGCAGGA 
1501 GTTGTTCGGC ATCACAGGTA GCTGAGCGGC TGCCCTTGGG TGACACCTCC 
1551 GAGAGGCAGC CAGACCCAGA GCCCTCTGAG CCGCCACTCC CGGGCCAAGA 
1601 CAGATGGACA CTGCCAAGAG CCGACAATGC CCTGCTGGCC TGTCTCCCTA 
1651 GGGAATTCCT GCTATGACAG CTGGCTAGCA TTCCTCAGGA AGGACATGGG 
1701 TGCCCCCCAC CCCCAGTTCA GTCTGTAGGG AGTGAAGCCA CAGATTCTTA 
1751 CGTGGAGAGT GCACTGACCT GTAGGTCAGG ACCATCAGAG AGGCAAGGTT 
1801 GCCCTTTCCT TTTAAAAGGC CCTGTGGTCT GGGGAGAAAT CCCTCAGATC 
1851 CCACTAAAGT GTCAAGGTGT GGAAGGGACC AAGCGACCAA GGATAGGCCA 
1901 TCTGGGGTCT ATGCCCACAT ACCCACGTTT GTTCGCTTCC TGAGTCTTTT 
1951 CATTGCTACC TCTAATAGTC CTGTCTCCCA CTTCCCACTC GTTCCCCTCC 
2001 TCTTCCGAGC TGCTTTGTGG GCTCCAGGCC TGTACTCATC GGCAGGTGCA 
2051 TGAGTATCTG TGGGAGTCCT CTAGAGAGAT GAGAAGCCAG GAGGCCTGCA 
2101 CCAAATGTCA GAAGCTTGGC ATGACCTCAT TCCGGCCACA TCATTCTGTG 
2151 TCTCTGCATC CATTTGAACA CATTATTAAG CACCGATAAT AGGTAGCCTG 

FIG.4B 
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2201 CTGTGGGGTA TACAGCATTG ACTCAGATAT AGATCCTGAG CTCACAGAGT 
2251 TTATAGTTAA AAAAACAAAC AGAAACACAA ACAATTTGGA TCAAAAGGAG 
2301 AAATGATAAG TGACAAAAGC AGCACAAGGA ATTTCCCTGT GTGGATGCTG 
2351 AGCTGTGATG GCGGGCACTG GGTACCCAAG TGAAGGTTCC CGAGGACATG 
2401 AGTCTGTAGG AGCAAGGGCA CAAACTGCAG CTGTGAGTGC GTGTGTGTGA 
2451 TTTGGTGTAG GTAGGTCTGT TTGCCACTTG ATGGGGCCTG GGTTTGTTCC 
2501 TGGGGCTGGA ATGCTGGGTA TGCTCTGTGA CAAGGCTACG CTGACAATCA 
2551 GTTAAACACA CCGGAGAAGA ACCATTTACA TGCACCTTAT ATTTCTGTGT 
2601 ACACATCTAT TCTCAAAGCT AAAGGGTATG AAAGTGCCTG CCTTGTTTAT 
2651 AGCCACTTGT GAGTAAAAAT TTTTTTGCAT TTTCACAAAT TATACTTTAT 
2701 ATAAGGCATT CCACACCTAA GAACTAGTTT TGGGAAATGT AGCCCTGGGT 
2751 TTAATGTCM ATCAAGGCAA AAGGAATTAA ATAATGTACT TTTGGCTAGA 
2801 GGGGTAAACT TTTTTGGCCT TTTTCTGGGG AAAATAATGT GGGGGTGTGG 
C5EQ ID N0:17) 

FIG.4C 
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TCrATCCTAATAr.GACTCACTATAGGGCTCGAGCGGCr^rr.rnnGrAGnTCTTTTGGCrT 60 

GCTGGGTTAGTGCTGGCAGCCCCCTGAGGCCAAGGACAGrAGGATGACAGTCACCAGGAr. 120 

M T V T R T 

TrACrACTTCAAGGAGGGGTCCCTCAGAGCAGrTGGCATArr rrTGrACAGTGrTGGGGr 180 
HHFKEGSLRAPAIPLHSAAA 

TGAGTTGGCTTCAAACCA TCCAAGAGGCCCAGAAGCAAACf.TGGAGGTGAGACCCAAAGA 240 
ELASNHPRGPEANLEVRPKE 

AAGCTGGAACCATGCTGACTTTGTACACTGTGAGGACACAGAGTCTGTTCCTGGAAAGCC 300 
SWNHADFVHCEDTESVPGKP 

CAGTGTCAACGCAGATGAGGAAGTCGGAGGTCCCCAAATCTGCCGTGTATGTGGGGACAA 360 
SVNADEEVGGPQI f R V C G D K 

GGCCACTGGCTATCACTTCAATGTCATGACATGTGAAGGATGCAAGGGCTTTTTCAGGAG 420 
ATGYHFNVMTCFGr. KGFFRR 

GGCCATGAAACGCAACGCCCGGCTGAGGTGCCCCTTCCGGAAGGGCGCCTGCGAGATCAC 480 
AMKRNARLRCPFRKGACEIT 

CCGGAAGACCCGGCGACAGTGCCAGGCCTGCCGCCTGCGCAAGTGCCTGGAGAGCGGCAT 540 
RKTRRQCQACRLRKCLESGM 

GAAGAAGGAGATGATCATGTCCGACGAGGCCGTGGAGGAGAGGCGGGCCTTGATCAAGCG 600 
KKEMI MSDEAVEERRAL I K R 

GAAGAAAAGTGAACGGACAGGGACTCAGCCACTGGGAGTGCAGGGGCTGACAGAGGAGCA 660 
KKSERTGTQPLGVQGLTEEQ 

GCGGATGATGATCAGGGAGCTGATGGACGCTCAGATGAAAACCTTTGACACTACCTTCTC 720 
RMM IREIMDAQMKTFDTTFS 

CCATTTCAAGAATTTCCGGCTGCCAGGGGTGCTTAGCAGTGGCTGCGAGTTGCCAGAGTC 780 
HFKNFRLPGVL5SGCELPES 

TCTGCAGGCCCCATCGAGGGAAGAAGCTGCCAAGTGGAGCCAGGTCCGGAAAGATCTGTG 840 
LOAPSREEAAKWSQVRKDLC 

CTCTTTGAAGGTCTCTCTGCAGCTGCGGGGGGAGGATGGCAGTGTCTGGAACTACAAACC 900 
SLKVSLQLRGEDGSVWNYKP 

CCCAGCCGACAGTGGCGGGAAAGAGATCTTCTCCCTGCTGCCCCACATGGCTGACATGTC 960 
PADSGGKEI FSLLPHMADMS 

AACCTACATGTTCAAAGGCATCATCAGCTTTGCCAAAGTCATCTCCTACTTCAGGGACTT 1020 
T Y M F K G I ISFAKVISYFRDL 

F1G.5A 
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GCCCATCGAGGACCAGATCTCCCTGCTGAAGGGGGCCGCTTTCGAGCTGTGTCAACTGAG 1080 
PIEDQISLLKGAAFELCQLR 

ATTCAACACAGTGTTCAACGCGGAGACTGGAACCTGGGAGTGTGGCCGGCTGTCCTACTG 1140 
FNTVFNAETGTWECGRLSYC 

CTTGGAAGACACTGCAGGTGGCTTCCAGCAACTTCTACTGGAGCCCATGCTGAAATTCCA 1200 
LEDTAGGFQQLLLEPMLKFH 

CTACATGCTGAAGAAGCTGCAGCTGCATGAGGAGGAGTATGTGCTGATGCAGGCCATCTC 1260 
YMLKKLQLHEEEYVLMQA IS 

CCTCTTCTCCCCAGACCGCCCAGGTGTGCTGCAGCACCGCGTGGTGGACCAGCTGCAGGA 1320 
LFSPDRPGVLQHRVVDQLOE 

GCAATTCGCCATTACTCTGAAGTCCTACATTGAATGCAATCGGCCCCAGCCTGCTCATAG 1380 
QFAITLKSY I ECNRPQPAHR 

GTTCTTGTTCCTGAAGATCATGGCTATGCTCACCGAGCTCCGCAGCATCAATGCTCAGCA 1440 
FLFLKIMAMLTELRSI NAQH 

CACCCAGCGGCTGCTGCGCATCCAGGACATACACCCCTTTGCTACGCCCCTCATGCAGGA 1500 
TQRLLR1QDIHPFATPLMQE 

GTTGTTCGGCATCACAGGTAGCTGAGCGGCTGCCCTTGGGTGACACCTCCGAGAGGCAGC 1560 
L F G I T G S CSEQ ID NO:18) 

CAGACCCAGAGCCCTCTGAGCCGCCACTCCCGGGCCAAGACAGATGGACACTGCCAAGAG 1620 

CCGACAATGCCCTGCTGGCCTGTCTCCCTAGGGAATTCCTGCTATGACAGCTGGCTAGCA 1680 

TTCCTCAGGAAGGACATGGGTGCCCCCCACCCCCAGTTCAGTCTGTAGGGAGTGAAGCCA 1740 

CAGATTCTTACGTGGAGAGTGCACTGACCTGTAGGTCAGGACCATCAGAGAGGCAAGGTT 1800 

GCCCTTTCCTTnAAMGGCCCTGTGGTCTGGGGAGAAATCCCTCAGATCCCACTAAAGT 1860 

GTCAAGGTGTGGAAGGGACCAAGCGACCAAGGATAGGCCATCTGGGGTCTATGCCCACAT 1920 

ACCCACGTTTGTTCGCTTCCTGAGTCTTTTCATTGCTACCTCTAATAGTCCTGTCTCCCA 1980 

CTTCCCACTCGTTCCCCTCCTCTTCCGAGCTGCTTTGTGGGCTCCAGGCCTGTACTCATC 2040 

GGCAGGTGCATGAGTATCTGTGGGAGTCCTCTAGAGAGATGAGAAGCCAGGAGGCCTGCA 2100 

CCAAATGTCAGAAGCTTGGCATGACCTCATTCCGGCCACATCATTCTGTGTCTCTGCATC 2160 

CATTTGAACACATTATTAAGCACCGATAATAGGTAGCCTGCTGTGGGGTATACAGCATTG 2220 

FIG.5B 
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ACTCAGATATAGATCCTGAGCTCACAGAGTTTATAGTTAAAAAAACAAACAGAAACACAA 2280 
ACAATTTGGATCAAAAGGAGAAATGATAAGTGACAAAAGCAGCACAAGGAATTTCCCTGT 2340 
GTGGATGCTGAGCTGTGATGGCGGGCACTGGGTACCCAAGTGAAGGTTCCCGAGGACATG 2400 
AGTCTGTAGGAGCAAGGGCACAAACTGCAGCTGTGAGTGCGTGTGTGTGATTTGGTGTAG 2460 
GTAGGTCTGTTTGCCACTTGATGGGGCCTGGGTTTGTTCCTGGGGCTGGAATGCTGGGTA 2520 
TGCTCTGTGACAAGGCTACGCTGACAATCAGTTAAACACACCGGAGAAGAACCATTTACA 2580 
TGCACCTTATATTTCTGTGTACACATCTATTCTCAAAGCTAAAGGGTATGAAAGTGCCTG 2640 
CCTTGTTTATAGCCACTTGTGAGTAAAAATTTTTTTGCATTTTCACAAATTATACTTTAT 2700 
ATAAGGCATTCCACACCTAAGAACTAGTTTTGGGAAATGTAGCCCTGGGTTTAATGTCAA 2760 
ATCAAGGCAAAAGGAATTAAATAATGTACTTTTGGCTAGAGGGGTAAACI ! i I I TGGCCT 2820 
TTTTCTGGGGAAAATAATGTGGGGGTGTGG (SEQ ID NO: 17) 2850 

FIG.5C 
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1 MTVTRTHHFK FGS1 RAPAIP LHSAAAELAS NH PRGPEANL EVRPKESWNH 

51 ADFVHCEDTE SVPGKPSVNA DEEVGGPQIC RVCGDKATGY HFNVMTCEGC 

101 KGFFRRAMKR NARLRCPFRK GACEITRKTR RQCQACRLRK CLESGMKKEM 

151 IMSDEAVEER RALIKRKKSE RTGTQPLGVQ GLTEEQRMMI RELMDAQMKT 

201 FDTTFSHFKN FRLPGVLSSG CELPESLQAP SREEAAKWSQ VRKDLCSLKV 

251 SLQLRGEDGS VWNYKPPADS GGKE1F5LLP HMADMSTYMF KGIISFAKVI 

301 SYFRDLPIED QISLLKGAAF ELCQLRFNTV FNAETGTWEC GRLSYCLEDT 

351 AGGFQQLLLE PMLKFHYMLK KLQLHEEEYV LMOAISLFSP DRPGVLQHRV 

901 VDQLQEQFAI TLKSYIECNR PQPAHRFLFL KIMAMLTELR SINAQHTQRL 
451 LRIQDIHPFA TPLMQELFGI TGS CSEQ ID NO: 18) 

FIG. 6 
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SEQUENCE LISTING 

<110> Merck & Co., Inc. 

<120> DNA MOLECULES ENCODING HUMAN NUCLEAR 
RECEPTOR PROTEINS, nNR-7 AND nNR7-l 

<130> 20084Y PCT 
<160> 24 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 3093 
<212> DNA 

<213> Homo sap i en (human) 
<400> 1 

tacgccaagc tcgaaattaa ccctcactaa agggaacaaa agctggagct ccaccgcggt 
ggcggccgct ctagaactag tggatccccc gggctgcagg aattcgaatt ctcataacct 
atgactagga cgggaagagg aagcactgcc tttacttcag tgggaatctc ggcctcagcc 
tgcaagccaa gtgttcacag tgagaaaagc aagagaataa gctaatactc ctgtcctgaa 
caaggcagcg gctccttggt aaagctactc cttgatcgat cctttgcacc ggattgttca 
aagtggaccc caggggagaa gtcggagcaa agaacttacc accaagcagt ccaagaggcc 
cagaagcaaa cctggaggtg agacccaaag aaagctggaa ccatgctgac tttgtacact 
gtgaggacac agagtctgtt cctggaaagc ccagcgtcaa cgcagatgag gaagtcggag 
gtccccaaat ctgccgtgta tgtggggaca aggccactgg ctatcacttc aatgtcatga 
catgtgaagg atgcaagggc tttttcagga gggccacgaa acgcaacgcc cggctgaggt 
gccccttccg gaagggcgcc tgcgagatca cccggaagac ccggcgacag tgccaggcct 
gccgcctgcg caagtgcctg gagagcggca tgaagaagga gatgatcatg tccgacgagg 
ccgtggagga gaggcgggcc ttgatcaagc ggaagaaaag tgaacggaca gggactcagc 
cactgggagt gcaggggctg acagaggagc agcggatgat gatcagggag ctgatggacg 
ctcagatgaa aacctttgac actaccttct cccatttcaa gaatttccgg ctgccagggg 
tgctcagcag tggctgcgag ttgccagagt ctctgcaggc cccatcgagg gaagaagctg 
ccaagtggag ccaggtccgg aaagatctgt gctctttgaa ggtctctctg cagctgcggg 
gggaggatgg cagtgtctgg aactacaaac ccccagccga cagtggcggg aaagagatct 
tctccctgct gccccacatg gctgacatgt caacctacat gttcaaaggc atcatcagct 
ttgccaaagt catctcctac ttcagggact tgcccatcga ggaccagatc tccctgctga 
agggggccgc tttcgagctg tgtcaactga gattcaacac agtgttcaac gcggagactg 1260 
gaacctggga gtgtggccgg ctgtcctact gcttggaaga cactgcaggt ggcttccagc 1320 
aacttctact ggagcccatg ctgaaattcc actacatgct gaagaagctg cagctgcatg 111Qn 
aggaggagta tgtgctgatg caggccatct ccctcttctc cccagaccgc ccaggtgtgc 
tgcagcaccg cgtggtggac cagctgcagg agcaattcgc cattactctg aagtcctaca 
ttgaatgcaa tcggccccag cctgctcata ggttcttgtt cctgaagatc atggctatgc 
tcaccgagct ccgcagcatc aatgctcagc acacccagcg gctgctgcgc atccaggaca 
tacacccctt tgctacgccc ctcatgcagg agttgttcgg catcacaggt agctgagcgg 
ctgcccttgg gtgacacctc cgagaggcag ccagacccag agccctctga gccgccactc 1740 
ccgggccaag acagatggac actgccaaga gccgacaatg ccctgctggc ctgtctccct 1 Q 
agggaattcc tgctatgaca gctggctagc attcctcagg aaggacatgg gtgcccccca 
cccccagttc agtctgtagg gagtgaagcc acagattctt acgtggagag tgcactgacc 
tgtaggtcag gaccatcaga gaggcaaggt tgccctttcc ttttaaaagg ccctgtggtc 
tggggagaaa tccctcagat cccactaaag tgtcaaggtg tggaagggac caagcgacca 
aggataggcc atctggggtc tatgcccaca tacccacgtt tgttcgcttc ctgagtcttt 
tcattgctac ctctaatagt ccrgtctccc acttcccact cgttcccctc ctcttccgag 

- 1- 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



1380 
1440 
1500 
1560 
1620 
1680 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
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ctgctttgtg ggctccaggc ctgtactcat 
tctagagaga tgagaagcca ggaggcctgc 
ttccggccac atcattctgt gtctctgcat 
taggtagcct gctgtggggt atacagcatt 
tttatagtta aaaaaacaaa cagaaacaca 
gtgacaaaag cagcacaagg aatttccctg 
gggtacccaa gtgaaggttc ccgaggacat 
gctgtgagtg cgtgtgtgtg atttggtgta 
gggtttgttc ctggggctgg aatgctgggt 
agttaaacac accggagaag aaccaCttac 
ttctcaaagc taaagggtat gaaagtgcct 
tttttttgca ttttcacaaa ttatacttta 
ttgggaaatg tagccctggg tttaatgtca 
ttttggctag aggggtaaac ttttttggcc 
ggaattcgaa ttcgatatca agcttatcga 
ccaattcgcc ctatagtgag tcgtattaca 

<210> 2 
<211> 466 
<212> PRT 

<213> Homo sapien (human) 
<400> 2 



Ser 


He 


Leu 


Cys 


Thr 


Gly 


Leu 


Phe 


Lys 


Val 


Asp 


Pro 


Arg 


Gly 


Glu 


Val 


1 






5 










10 










15 




Gly 


Ala 


Lys 


Asn 


Leu 


Pro 


Pro 


Ser 


Ser 


Pro 


Arg 


Gly 


Pro 


Glu 


Ala 


Asn 




20 










25 










30 






Leu 


Glu 


Val 


Arg 


Pro 


Lys 


Glu 


Ser 


Trp 


Asn 


His 


Ala 


Asp 


Phe 


Val 


His 






35 










40 










45 








Cys 


Glu 


Asp 


Thr 


Glu 


Ser 


Val 


Pro Gly 


Lys 


Pro 


Ser 


Val 


Asn 


Ala 


Asp 




50 










55 










60 










Glu 


Glu 


Val 


Gly 


Gly 


Pro 


Gin 


He 


Cys 


Arg 


Val 


Cys 


Gly 


Asp 


Lys 


Ala 


65 






70 










75 










80 


Thr Gly 


Tyr 


His 


Phe 


Asn 


Val 


Met 


Thr 


Cys 


Glu 


Gly 


Cys 


Lys 


Gly 


Phe 










85 










90 










95 




Phe 


Arg 


Arg 


Ala 


Met 


Lys 


Arg 


Asn 


Ala 


Arg 


Leu 


Arg 


Cys 


Pro 


Phe 


Arg 




100 










105 










110 






Lys 


Gly 


Ala 


Cys 


Glu 


He 


Thr 


Arg 


Lys 


Thr 


Arg 


Arg 


Gin 


Cys 


Gin 


Ala 






IIS 










120 










125 








cys 


Arg 


Leu 


Arg 


Lys 


Cys 


Leu 


Glu 


Ser 


Gly 


Met 


Lys 


Lys 


Glu 


Met 


He 




130 










135 










140 










Met 


Ser 


Asp 


Glu 


Ala 


Val 


Glu 


Glu 


Arg 


Arg 


Ala 


Leu 


He 


Lys 


Arg 


Lys 


145 








150 










155 










160 


Lys 


Ser 


Glu 


Arg 


Thr 


Gly 


Thr 


Gin 


Pro 


Leu 


Gly 


Val 


Gin 


Gly 


Leu 


Thr 






165 










170 










175 




Glu 


Glu 


Gin 


Arg 


Met 


Met 


He 


Arg 


Glu 


Leu 


Met 


Asp 


Ala 


Gin 


Met 


Lys 








180 










185 










190 






Thr 


Phe 


Asp 


Thr 


Thr 


Phe 


Ser 


His 


Phe 


Lys 


Asn 


Phe 


Arg 


Leu 


Pro 


Gly 






195 










200 










205 








Val 


Leu 


Ser 


Ser 


Gly 


Cys 


Glu 


Leu 


Pro 


Glu 


Ser 


Leu 


Gin 


Ala 


Pro 


Ser 




210 










215 










220 










Arg 


Glu 


Glu 


Ala 


Ala 


Lys 


Trp 


Ser 


Gin 


Val 


Arg 


Lys 


Asp 


Leu 


Cys 


Ser 


225 










230 










235 










240 


Leu 


Lys 


Val 


Ser 


Leu 


Gin 


Leu 


Arg 


Gly 


Glu 


Asp Gly 


Ser 


Val 


Trp 


Asn 








245 










250 










255 





cggcaggtgc atgagtatct 
accaaatgtc agaagcttgg 
ccatttgaac acattattaa 
gactcagata tagatcctga 
aacaatttgg atcaaaagga 
tgtggatgct gagctgtgat 
gagtctgtag gagcaagggc 
ggtaggtctg tttgccactt 
atgctctgtg acaaggctac 
atgcacctta tatttctgtg 
gccttgttta tagccacttg 
tataaggcat tccacaccta 
aatcaaggca aaaggaatta 
tttttctggg gaaaataatg 
taccgtcgac ctcgaggggg 
att 



gtgggagtcc 2220 

catgacctca 2280 

gcaccgataa 2340 

gctcacagag 2400 

gaaatgataa 2460 

ggcgggcact 2520 

acaaactgca 2580 

gatggggcct 2 640 

gctgacaatc 2700 

tacacatcta 2760 

tgagtaaaaa 282 0 

agaactagtt 2880 

aataatgtac 2940 

tgggggtgtg 3 000 

ggcccggtac 3060 
3093 
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<210> 3 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 3 
cttcaatgtc atgacatg 

<210> 4 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 4 
ccaaatctgc cgtgtatgtg 

<210> 5 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Oligonucleotide 

<400> 5 
gtcagtgcac tctccacgt 

<210> 6 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 6 
tgcagctggt ccaccacgcg 

<210> 1 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 7 
gggtatgctc tgtgacaag 

<210> 8 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 8 
aggcaggcac tttcatacc 

<210> 9 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 9 
tttcgagctt ccaggttcat 

<210> 10 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Oligonucleotide 

<400> 10 
ctcccaaact ctgcctggtg 

<210> 11 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 11 
cgggagccac acttcaccat 

<210> 12 

<211> 20 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Oligonucleotide 

<400> 12 
gctcacttct gcgctgtctg 

<210> 13 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 13 
ttccgggctc ccagagtcat 

<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 14 
cagaagacct gcctgatctg 

<210> 15 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Oligonucleotide 
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<400> 15 
gaaatgaact ccttcatcat 

<210> 16 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 16 
ccggatctgt ggggtgtgtg 

<210> 17 
<211> 2850 
<212> DNA 

<213> Homo sapien (human) 
<400> 17 

tccatcctaa tacgactcac tatagggctc gagcggccgc ccgggcaggt cttttggcct 
gctgggttag tgctggcagc cccctgaggc caaggacagc agcatgacag tcaccaggac 
tcaccacttc aaggaggggt ccctcagagc acctgccata cccctgcaca gtgctgcggc 
tgagttggct tcaaaccatc caagaggccc agaagcaaac ctggaggtga gacccaaaga 
aagctggaac catgctgact ttgtacactg tgaggacaca gagtctgttc ctggaaagcc 



20 



20 



60 
120 
ISO 
240 
300 



cagtgtcaac gcagatgagg aagtcggagg tccccaaatc tgccgtgtat gtggggacaa 3 60 

ggccactggc tatcacttca atgtcatgac atgtgaagga tgcaagggct ttttcaggag 420 
ggccatgaaa cgcaacgccc ggctgaggtg ccccttccgg aagggcgcct gcgagatcac 480 
ccggaagacc cggcgacagt gccaggcctg ccgcctgcgc aagtgcctgg agagcggcat 540 
gaagaaggag atgatcatgt ccgacgaggc cgtggaggag aggcgggcct tgatcaagcg 600 
gaagaaaagt gaacggacag ggactcagcc actgggagtg caggggctga cagaggagca 660 
gcggatgatg atcagggagc tgatggacgc tcagatgaaa acctttgaca ctaccttctc 720 
ccatttcaag aatttccggc tgccaggggt gcttagcagt ggctgcgagt tgccagagtc 
tctgcaggcc ccatcgaggg aagaagctgc caagtggagc caggtccgga aagatctgtg 
ctctttgaag gtctctctgc agctgcgggg ggaggatggc agtgtctgga actacaaacc 
cccagccgac agtggcggga aagagatctt ctccctgctg ccccacatgg ctgacatgtc 
aacctacatg ttcaaaggca tcatcagctt cgccaaagtc atctcctact tcagggactt 
gcccatcgag gaccagatct ccctgctgaa gggggccgct ttcgagctgt gtcaactgag 
attcaacaca gtgttcaacg cggagactgg aacctgggag tgtggccggc tgtcctactg 1140 



780 
840 
900 
960 
1020 
1080 



cttggaagac actgcaggtg gcttccagca acttctactg gagcccatgc tgaaattcca 

ctacatgctg aagaagctgc agctgcatga ggaggagtat gtgctgatgc aggccatctc 1260 

cctcttctcc ccagaccgcc caggtgtgct gcagcaccgc gtggtggacc agctgcagga 1320 
gcaattcgcc attactctga agtcctacat tgaatgcaat cggccccagc ctgctcatag 
gttcttgttc ctgaagatca tggctatgct caccgagctc cgcagcatca atgctcagca 
cacccagcgg ctgctgcgca tccaggacat acaccccttt gctacgcccc tcatgcagga 

gttgttcggc atcacaggta gctgagcggc tgcccttggg tgacacctcc gagaggcagc 1560 

cagacccaga gccctctgag ccgccactcc cgggccaaga cagatggaca ctgccaagag 1620 

ccgacaatgc cctgctggcc tgtctcccta gggaattcct gctatgacag ctggctagca 1680 

ttcctcagga aggacatggg tgccccccac ccccagttca gtctgtaggg agtgaagcca 1740 

cagattctta cgtggagagt gcactgacct gtaggtcagg accatcagag aggcaaggtt 1800 

gccctttcct tttaaaaggc cctgtggtct ggggagaaat ccctcagatc ccactaaagt 1860 

gtcaaggtgt ggaagggacc aagcgaccaa ggataggcca tctggggtct atgcccacat 1920 

acccacgttt gttcgcttcc tgagtctttt cattgctacc tctaatagtc ctgtctccca 1980 

cttcccactc gttcccctcc tcttccgagc tgctttgtgg gctccaggcc tgtactcatc 2040 

ggcaggtgca tgagtatctg tgggagtcct ctagagagat gagaagccag gaggcctgca 2100 



1380 
1440 
1500 
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ccaaatgtca 
catttgaaca 
actcagatat 
acaatttgga 
gtggatgctg 
agtctgtagg 
gtaggtctgt 
tgctctgtga 
tgcaccttat 
ccttgtttat 
ataaggcatt 
atcaaggcaa 
ttttctgggg 



gaagcttggc 
cattattaag 
agatcctgag 
tcaaaaggag 
agctgtgatg 
agcaagggca 
ttgccacttg 
caaggctacg 
atttctgtgt 
agccacttgt 
ccacacctaa 
aaggaattaa 
aaaataatgt 



atgacctcat 
caccgataat 
ctcacagagt 
aaatgataag 
gcgggcactg 
caaactgcag 
atggggcctg 
ctgacaatca 
acacatctat 
gagtaaaaat 
gaactagttt 
ataatgtact 
gggggtgtgg 



tccggccaca 
aggtagcctg 
ttatagttaa 
tgacaaaagc 
ggtacccaag 
ctgtgagtgc 
ggtttgttcc 
gttaaacaca 
tctcaaagct 
ttttttgcat 
tgggaaatgt 
tttggctaga 



tcattctgtg 
ctgtggggta 
aaaaacaaac 
agcacaagga 
tgaaggttcc 
gtgtgtgtga 
tggggctgga 
ccggagaaga 
aaagggtatg 
tttcacaaat 
agccctgggt 
ggggtaaact 



tctctgcatc 
tacagcattg 
agaaacacaa 
atttccctgt 
cgaggacatg 
tttggtgtag 
atgctgggta 
accatttaca 
aaagtgcctg 
tatactttat 
ttaatgtcaa 
tttttggcct 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2850 
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<210> 19 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 19 
aagcccttgc atccttcaca 

<210> 20 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 20 
ccatcctaat acgactcact atagggc 

<210> 21 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
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<400> 21 

gtaccgagct cggatccact a 21 

<210> 22 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 22 

ccgccagtgt gatggatatc t 21 

<210> 23 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 23 

ctcatctgcg ttgacactgg g 21 

<210> 24 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



<400> 24 
tgaactcaaa ggaggtca 
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